We’ve done a lot of work this year on improving the availability of our Service. Over the past several months we have moved to completion of that in parallel with our scalability project and have reached a point of stability and the highest level of service we can think of in our architecture.
This means you and your runners can use our site all the time!
Of course we are not perfect, but we did pretty well in 2012. During the first 6 months we had 14 HOURS of downtime. All of that downtime would have been eliminated with the changes we have made in the last 6 months. In the second half of the year we had 36 MINUTES of downtime. Most of that was due to either updating our infrastructure which took the service offline momentarily or because of our scalability testing when we were testing and improving our limits. We also started using New Relic in July so that we would have an outside service testing our uptime for us. Here is their report on a weekly basis:
We are striving for downtime of less than an hour in 2013. Let’s hope we meet that goal for the benefit of all of us!