There were two areas where I was over the free quota: the database was too large (the free quota is 1GB), and I was using to much CPU for the database requests.
Reducing the database size
The database was taking more than 1GB because I never deleted any of the submitted scores/replay. I wrote an handler that erased the lowest scores. Erasing ~400k scores cost me approximately 4$.If you have a database with tens of millions of entries, doing operations on them can become expensive!
|  | 
| The handler could only erase 1000 entries at a time before timing out, so I "automatized" its call. | 
Reducing the CPU usage
I thought optimizing the CPU usage was just going to be a matter of caching the queries. Turns out, there's a right way and a wrong way to do that.Initially, the scores for the 8 game modes were generated this way:
for level in levels:
  query_str = get_query(level)
  scores = db.GqlQuery(query_str)
  for entry in scores:
    #write the score, player_name
And to cache the results, I did this :
for level in levels:
  query_str = get_query(level)
  scores = memcache.get(query_str)
  if scores is None:
    scores = db.GqlQuery(query_str)
    memcache.set(query_str, scores, 60 * 60 * 24)
  for entry in scores:
    #write the score, player_name
But doing this did not improve my CPU usage, and profiling my app showed that I was still doing a lot of database call, even though none appear in my code. I assume I was not caching the entries, only references to the results. Reading the entries still required doing some database calls:
The correct way to do things is to manually put in the cache the strings and numbers you need to store, not references to the entries in your database.
Once this was done, I got the expected 8 memcache calls and divided the response time by 40:


 
 
