Ask yourself these questions to find the right fit in an alert correlation platform.
To maintain operational visibility in modern IT environments, companies are abandoning monolithic monitoring solutions from legacy vendors in favor of a modern set of “best of breed” monitoring tools. Today’s average IT monitoring stack consists of about 6-8 tools, including at least one from each of the following categories: systems monitoring, end user monitoring, application performance monitoring (APM), error detection, log analytics, chat, and ticketing. When service disruptions occur, operations engineers face a flood of alerts across different layers of the IT stack, with no fast way to figure out what’s really going on. Customers are left stranded, while IT professionals struggle to detect, triage and remediate urgent issues. Downtime abounds which negatively impacts revenue, performance, and brand loyalty.
Given that the very development of Icinga arose from the need for additional functionalities in open source monitoring, it’s little surprise that the tool has become indispensable for so many IT professionals. Its configurability and flexibility allow for a sophisticated approach to monitoring, which is both scalable and extensible to large, complex environments.
Salesforce likely lost quite a bit of money last Tuesday. IDC estimates that the typical infrastructure failure costs organizations $100,000 per hour, while a critical application failure costs as much as $500,000 to $1 million per hour. Salesforce was down for over 20 hours and still continued to have service disruptions. This in turn translated to heavy financial loss for Salesforce customers worldwide, as they struggled to manage their lifeblood processes that depend on the SaaS giant. The Salesforce reputation struggled and the CEO, Marc Benioff, meted out public apologies on social channels.
We all need to move fast in order to stay competitive. But the faster things move, the faster things break.
While many companies have made great strides towards automating application release and infrastructure management, automation for service assurance has been sorely lacking. That’s left Dev and Ops with a problem: how to effectively service alerts that have grown by orders of magnitude.
SREcon16 is a wrap, and our team had a blast at this year’s event! Both days were non-stop action: demos, discussions, and - of course - handing out our fair share of panda swag. Between the buzz on the floor and in the sessions, what topics were top of mind at this year’s show? Here are our three key takeaways:
You’ve solved your noisy alert problem with BigPanda. Now solve your noisy ChatOps problem with BigPanda and HipChat, thanks to HipChat’s new integrations platform, HipChat Connect.
If every incident update were to push a new message, your Ops chatrooms would quickly become more crowded than O’Malley’s Pub on St. Patty’s Day. BigPanda now integrates with HipChat via HipChat Connect, so you can not only view the status of BigPanda incidents in HipChat, but also view incident details with links to relevant actions in the glance view beside the chat room.
In October 2015, BigPanda launched #FlyAboveTheNoise, a program geared at assisting IT & DevOps professionals to rise above their IT alert noise. The premise was simple: Trade in your noisy IT alerts by taking a trial of BigPanda, and you get a drone.
“Fly Above the Noise was a huge success. 3,000+ respondents and millions of correlated alerts later, it’s clear that the world not only needs to be free from alert noise, but it needs drones to automate the event management process,” said BigPanda CEO Assaf Resnick.