Excelent video about youtube DBA explaining performance tuning and scalability stuff. The slides are missing so if anyone has them post a comment.

In this video you can see Paul Tuckfield, talk about how they scaled Youtube, some interesting points:

Things that made youtube scale:

  • Python
  • Caching (before getting to the DB)
  • DB tuning (that's what the video is about

From the DB tuning my highlights are these:

  • Views count goes to a MyIsam DB
  • Favourites, playlists, etc goes to a DB that thens replicates to other 5, as to divide the read queries
  • Slow replication is a very big problem and is difficult to keep replication fast.
  • InnoDB doesn't recover easily and is not nice to switch to the replica because you are not sure if that's OK.
  • You can test things in one replica and if it works pass that change to the rest of them.
  • A good trick is to have a replica with some lag as a way to have a easy restorable backup.
  • When migrating to 5.0 is better to dump and load instead of updating in place.
  • To upgrade hardware means much more memory and many disk drives (even if they are small) instead of fast CPUs and big hard drives
  • If it is on cache you are ok, if your database fits in cache, go to have a coffee
  • Write Cache is OK, Read Cache is BAD in levels below the database(Linux or RAID).
  • For parallel reads is better to have big chunks
At the last part of the talk he explains some very interesting algorithms to predict the things that must be in the cache of replicas achieving excelent results.

Sorry, comments are closed for this article.

 

Search

Tags

3g (1) ads (2) apple (1) barcode (1) best practices (1) blog (2) code reviews (1) community (1) db (1) design (3) development (1) e256 (1) education (1) engine (1) event (1) flof (1) friendfeed (1) garfio (1) google (3) googlemaps (1) internet (2) iphone (1) java (1) management (1) mobile (2) mysql (1) n95 (1) news (5) nokia (1) oop (2) phone (5) phone iphone 3g apple content (1) popego (3) programming (5) python (1) qr (1) research (2) review (2) RIA (1) samsung (1) scalability (1) scm (1) ski (1) smalltalk (2) social (1) squeak (1) startup (1) svn (1) technology (3) video (1) viewty (1) web 2.0 (6) web 3.0 (2) widgets (1) xul (1) youtube (2) zauber (4)

Links

Categories

  • Home (15)
  • Archives