Geetanjali Chawla’s Post

View profile for Geetanjali Chawla, graphic

Software Developer @ Gamemano | MERN stack developer

BookMyShow's platform crashed today during the Coldplay India 2025 ticket sale 🚨 This highlights a major issue many companies face—managing high concurrency during sudden traffic surges. As developers, it's crucial we focus on building robust, scalable systems to handle such spikes and avoid crashes. Here are key areas we need to focus on to future-proof our platforms: 1. Scalable Infrastructure: Utilize cloud-based auto-scaling to manage traffic fluctuations seamlessly. 2. Load Balancing: Implement tools like Nginx or AWS ELB to distribute traffic evenly across servers. 3. Distributed Systems: Shift to microservices architecture for flexible, independent scalability. 4. Caching: Leverage Redis or CDNs to reduce server load by caching frequent requests. 5. API Rate Limiting: Implement rate limiting to cap API requests and prevent overload. 6. Load Testing: Regularly simulate high-concurrency scenarios using tools like JMeter. In a world where millions of users can flood your platform at any moment, building for scale is non-negotiable. Let’s start preparing today to avoid future meltdowns. #BookMyShow #Zomato #Paytm #JavaScriptMastery #freeCodeCamp #WebDevelopment #Cloud #ScalableSystems #TechTalk

Mutha Nagavamsi

Kubernetes, Devops, Cloud & Tech. Trying to be useful. Learning everyday, sharing important pieces here & in the newsletter, do join. 75K+ strong all socials 💪

3mo

Is there a official blog post explaining the situation at bms? I think they must be using everything you mentioned here. It's a pure cost vs user experiance problem. Viral traffic is always hard to handle unless there is optimization at infra later, app layer and minimum budget support required to facilitate scaling in cloud.

Shyam Pradhan

Senior Analyst at TIAA | Software Engineer | SRE

3mo

In many cases, companies already have scalable infrastructure, load balancing, and distributed systems in place. The real challenge lies in predicting and handling unpredictability of incoming traffic.

Aatif H.

Software Architect at Wipro Ltd. | Python | Java | Spring Boot | Spring Cloud | Microservices | REST | Kafka | MongoDB | Agile |

3mo

BookMyShow engineering team will be overwhelmed if they come across all the above points and must scratch their heads, how can we be so apprentice and forget to consider all these points in the first place while developing an e-commerce website.!! 😬

Sandeep Sahu

DevOps Engineer | Jenkins | Maven | Jira | Confluence | Slack | Docker | Kubernetes | Terraform | Sonarqube | GitHub | Grafana | Prometheus | AWS | Gitlab CI | Trivy | OWASP | Windows | Linux | ServiceNow

2mo

Also they have used the monitoring tools like Datadog or Kibana or Grafana or Prometheus. For better observation to the nodes Geetanjali Chawla

Venkateshwaralu M

Senior Manager Site Reliability Engineer | Certified Site Reliability Engineer (SRE) | Devops | Multi Cloud | Containers

3mo

BookMyShow’s Response to the Crash In response to the technical issues, BookMyShow issued an apology on their social media platforms, acknowledging the crash and promising to resolve the issues as quickly as possible. They assured fans that more tickets would be made available once the platform stabilized and that everyone would have a fair chance to purchase tickets. BookMyShow representatives explained that the unprecedented demand for Coldplay tickets exceeded their server capacity, leading to the crashes. The company has since taken steps to upgrade their systems and manage the traffic more efficiently for future sales. https://themusicessentials.com/news/bookmyshow-coldplay-tickets/

Ajay Prabandham

Pragmatism better than Astigmatism!

2mo

Geetanjali Chawla Thanks for your post summarizing the important aspects of planning for the unplanned. However, things can, and do, go wrong - can't they? And this was a major, major outlier... do we really need to learn anything from this - apart from the ill effects of greed, marketing frenzy, media hype, etc? I think we should just shrug our shoulders, and reduce the options available - such as implementing booking rate-limiting... limit the number of concurrent users at any given time... there will be no crashes... People will then understand that population - read we, the people - is actually a serious issue...

Arun Rao

SRE Leader | Architecting Reliable, Scalable Systems | Cloud-Native, Security-Driven, Automation-Focused

3mo

This sounds like a gap in communication between business team and engineering teams. It is impossible to scale from few hundred TPS to hundreds of thousands of TPS within minutes. If the infrastructure (rate limiting) is not setup correctly, this will take down existing infrastructure.

Abhirup Ghosh

Software Production Stability Expert | Full Stack Support | Tech-Lead | JAVA | Microservices I SQL | UNIX | Ex-LTI | Ex-JIO |

2mo

I believe we have robust cloud platforms nowadays which specifically addresses this exact issue - BookmyShow team should have anticipated this beforehand and gone for an upgraded cloud plan 😊

Mohd Mubin Girach

DevOps Engineer at NIQ BrandBank | Tech Enthusiast | RedHat and Microsoft Certified | OpenShift | Kubernetes | Azure | Amazon Web Services | Cloud Engineer | Security Researcher | Perpetual Leaner

3mo

Kinnari Gohil you still have an opening for Site Reliability Engineer at BookMyShow :) Let's not take a chance next time!

See more comments

To view or add a comment, sign in

Explore topics