Blog

IT outages cost $14,056 per minute on average. What’s driving the increased costs? How can you use AIOps to reduce their frequency, duration, and impact?

BigPanda 24 brought together ITOps leaders from across industries to discuss the future of AIOps and IT operations. CEO Assaf Resnick shares his thoughts.

Discover how ServiceNow AIOps uses artificial intelligence and transforms incident management for smarter IT operations.

AIOps platforms bridge the complexities of modern IT environments and the need for streamlined, effective incident management.

DevOps practices in software development have revolutionized the way updates are released. However, many companies entrenched in ITIL practices find it challenging to seamlessly integrate with the DevOps practice of Continuous Integration and Continuous Delivery/Deployment (CI/CD). This is because ITIL focuses on stability, which suits older systems, while DevOps is ideal for modern setups with […]

IT leaders are thrilled about the potential of Generative AI for IT Operations. But they also want to know how it works, why it works, and what it will do for them before taking the leap and adopting this new technology. Allow me to share my perspective on the hype and the truth behind generative […]

Finding the root causes of IT anomalies can be challenging, but the rewards are worth it. By identifying the root cause or causes of an incident or critical failure, response teams can resolve incidents faster and determine the best steps to avoid having them recur. This can drive down both the frequency of service interruptions […]

Today, the majority of organizations operate under a hybrid cloud structure. Due to this, operations are consistently met with daily infrastructure and software changes and updates, which are also the primary cause of incidents and outages. Long gone are the days when a tech stack could be represented by a single dependency model. Microservices, CI/CD, […]

IT response teams find themselves battling against an overwhelming onslaught of incidents. Frustratingly long response times, challenges with prioritization, and the relentless pursuit of root cause are formidable adversaries that test even the most skilled teams. I remember customers’ electrifying anticipation with AI and automation a decade ago. They hoped AI could be used to […]

Beyond being the right thing to do, fostering inclusivity in the workplace empowers employees to show up fully as their authentic selves. “BigPanda is the first shop where I have openly identified as non-binary because I felt safe enough to,” said one employee. And considering how much of our life we spend working, BigPanda knows […]

What will 2023 look like for the industry? Our panel of industry experts share some insights to help you navigate the choppy waters.

Explore these use cases for AIOps to see how you can improve productivity, job satisfaction, and service quality from your ITOps teams.

The success of CI/CD depends on collaboration between IT Operations managers, DevOps engineers, and other stakeholders. Learn why CI/CD is important and why.

Whiskey and Wisdom is a monthly executive-only forum where ITOps leaders can network independently and discuss high-level AIOps and ITOps strategies with their industry peers.

How much do you know about AIOps and event correlation? In this post we explore everything from its origins to current state-of-the-art techniques and how it fits into integrated service management.

In our first session from RESOLVE ‘22, we were honored to have Darren Boyd and Satbir Sran from the Incubator podcast and ink8r think tank talk observability and AIOps with BigPanda’s Aaron Johnson. Both panelists are part of communities adopting open standards, and they regularly consult with organizations about how they can improve IT Operations […]

Tracking IT incident management key performance indicators (KPIs) is a vital step toward minimizing disruptions for customers and users. But metrics alone are not enough to drive improvement in incident management. Learn what KPIs to monitor and how to use them.

Integrating all of your monitoring alert sources is quite a task. Large enterprises often struggle to aggregate millions of data records from dozens of monitoring, change, and topology tools in real-time. Filtering out the noise and prioritizing the most important alerts are crucial to a team’s success. BigPanda makes it simple to integrate with any […]

The amount of data volume and complexity within tech stacks is continuing to increase with no sign of slowing down. As a result, many organizations are facing significant challenges related to tool sprawl and the overwhelming amount of data that needs to be exchanged between all the different systems. The result is this new rapid […]

It feels a bit surreal stepping into the Regional Vice President of Sales position here at BigPanda just a few months after the company achieved Unicorn status. In more than 15 years of managing enterprise software sales, this is the first time I knew I was going to play a critical role in facilitating a […]

Whiskey and Wisdom is a monthly executive-only forum where IT Operations leaders can network independently and discuss high-level AI operations and IT Ops strategies with their industry peers. In our most recent session, the discussion was around justifying AIOps—proving the value the technology brings to the table. Demonstrating ROI on AIOps tools requires its champion […]

CIO 100 is one of the most prestigious and coveted awards in technology, as it recognizes IT leaders who constantly seek new ways to integrate technology, business processes, and people to drive business agility and innovation for their organizations. BigPanda nominated Sibito Morley because we admire the vision he has, the team he leads and […]

The effects of BigPanda’s most recent round of funding—amounting to $190 million—will be reverberating throughout the company for years to come. And it’s not just BigPanda employees who have experienced a surge of enthusiasm in the wake of our Unicorn status. Our customers are thrilled at the prospect of more innovation from our team and […]

We’ve recently launched an executive event series, Whiskey and Wisdom, exclusively for IT Ops leaders to gather each month and talk about topics that are top-of-mind (and yes, there is whiskey!).

While the sessions are not recorded, Derrick Arakaki, VP of Customer Outcomes, started a blog called Overheard at Whiskey and Wisdom.

It’s a great way to get a feel for what’s keeping IT leaders up at night, and what ideas they are sharing with their peers.

Every IT Ops team needs to set KPIs, but it can be difficult to know which ones to use, how many to set, and how to derive value from them. In a recent conversation at our Bamboo Lounge community event, we talked with customers about how they tackle setting KPIs.

I am excited to announce today that BigPanda has secured $190 million in financing at a $1.2 billion valuation. This financing was led by Advent International and Insight Partners, together with our other existing investors. BigPanda is now officially a unicorn, and the clear leader in the rapidly growing AIOps market! Keeping the digital economy […]

Sharmila S., Senior Manager of the Network Operations Center at LogMeIn In the year since being named an Incident Commander, Sharmila S. — like many — was tasked with helping implement a work-from-home policy during the global pandemic. This, in fact, turned out to be one of her greatest successes over the past year. “It […]

David Levinger, Head of Operations at Machinify Since being named a 2020 Incident Commander, David and his team have worked through a familiar refrain: learning to remain productive in the era of remote workforces. His team has tackled the many challenges that arose in this new paradigm, such as keeping internal and external teams aligned. […]

This year we honored 10 of our top performers by giving back to their communities.  We asked each of our Panda Award Winners to choose an organization that we could give a donation to on their behalf. Our top 10 selected 12  amazing organizations. We were able to give $10,000 to our Pandas communities through […]

By Rob Schnepp, general partner at BlackRock 3 Partners, and Jason Walker, field CTO at BigPanda Downtime. It’s more than just a bar on the Rebel Alliance’s base on Folor. For IT Ops teams, downtime is not fun. It costs time, money and often, user frustration. It takes more than the Force to handle incidents…it […]

The future is here. Enter BigPanda and AIOps. BigPanda takes a tools agnostic approach that enables distributed teams to leverage the solution most effective for their needs, while providing IT Ops a centralized and unified manner to standardize incident management processes.

Kelly Looney, Global DevOps Lead at AWS, discusses the challenges IT Ops teams face as they embrace the cloud & why AIOps tools like BigPanda can be a solution.

Being an intern at BigPanda can be one of the most rewarding experiences you can imagine. Anthony Caldiero, who joined our Sales team for the summer, shares his experience.

You’ve just recovered from a critical application outage and your team is being asked to report on root cause and recommended remediation steps later this afternoon. Can you quickly analyze all the data, identify all the leading events, and discern which one was responsible for the cascading failure? Later that week, you are back to […]

IT operations teams have some of the most stressful jobs in IT. Keeping data centers online, servers running, enterprise systems functioning, and applications performing — all while responding to incidents and requests is hard work. While there are monitoring systems in place to provide visibility and change management practices that give IT some control over the […]

This is the first in a series of blog posts on Open Box Machine Learning. If you’re part of a large enterprise, you’re probably in the throes of digital transformation. If you’re in IT, you’re supporting your business by rolling out new services and apps weekly (or even daily). Meanwhile, your users expect 24×7 availability […]

It was a big week for BigPanda at the Gartner IT Operations Strategies & Solutions (IOSS) Summit, which was held May 15-17th in Orlando, Fla. More than the Big News we announced and our CEO Assaf’s presentation, we enjoyed more than 700 high-quality interactions at Gartner IOSS Summit with senior enterprise IT leaders. Gartner’s theme this […]

Orlando, here we come! BigPanda is excited to be a Platinum Sponsor of the upcoming Gartner IT Operations and Strategy Summit, kicking off in sunny Florida on Tuesday, May 15th. We’re even more thrilled that our CEO Assaf Resnick has been chosen to be among the prestigious list of Gartner IOSS Speakers. Gartner’s theme this […]

We’re happy to have Rise Broadband as one of our customers. We’re especially thankful for their willingness to share their story of how they overcame NOC challenges to improve service availability, gain better visibility over operations, and boost the productivity of their NOC team. If you’re not familiar, Rise is the largest fixed wireless service […]

Any organization can be defined by its operating principles. These are the fundamental norms, rules and values that represent what is desirable and positive for the group. Having well defined principles can help an organization operate as a “community” with a shared understanding of what is right and what is wrong. It’s key that these […]

For the past year or so, productivity experts have been talking about “Inbox Zero” – a rigorous fresh approach to email management. Reaching Inbox Zero stresses techniques to tame your inbox and keep it empty (or nearly empty) at all times. The practice promises to focus your time and attention on the most important tasks. At an […]

BigPanda exhibited and moderated a customer success panel at Gartner’s big IT Infrastructure, Operations Management & Data Center show in Las Vegas last month. The Gartner I&O Conference is action-packed, so it’s taken me awhile to digest all the great information and insights I gathered there. Gartner’s influence in the enterprise IT operations market is undisputed. […]

One of the world’s largest television and digital entertainment companies had a problem. A company-wide initiative to reinvent its service delivery demanded that IT help make its content available anytime, on any device. As a global media conglomerate, its enterprise spans numerous entertainment, news and sports networks – including broadcasts of major league basketball. Customer […]

Part 1 of this series defines algorithmic alert correlation and how it works. The term “algorithmic” describes how data science applies machine learning techniques to solve alert storms, aka alert floods. There are two flavors of machine learning currently being applied to this problem: one is “black box” and the other, “open box”. BigPanda applies open […]

The IT Operations tool stack is becoming exponentially more complex. This requires the utilization of a breadth of diverse monitoring tools in order to quickly detect and ultimately resolve critical issues before they can inflict real damage on the business. Most large enterprises already have a host of preferred monitoring tools installed and working. It […]

In his research note “Four Steps to Turbocharge Your Major Incident-Handling Capabilities”, Gartner analyst Kenneth Gonzalez makes a compelling argument for why enterprise IT service operations teams should upgrade their incident management workflow processes. Here’s BigPanda’s perspective on the topic. The Real Challenge: Most NOCs Aren’t Automated Most enterprises are undergoing some form of digital […]

What a week! Our team spent 5 days in Orlando last week, representing BigPanda at both Gartner’s IT Operations Strategies and Solutions Summit and ServiceNow’s Knowledge17 conferences. Here’s our wrap up on Gartner IOSS. Hundreds of enterprise IT leaders gathered in at the Hilton Orlando this past week to learn how IT can take advantage […]

Is your team ready for 2017? Featuring early release findings from BigPanda’s forthcoming State of Monitoring Report, our latest e-book takes a look at how key industry trends will affect IT operations in the upcoming year.

Life just got a whole lot simpler. BigPanda is pleased to announce the launch of our upgraded  BigPanda FAQs. The new location, look, and feel of the site make it easier than ever to access the information you need to be successful with BigPanda. Here are a few benefits of the upgrade:

Decompressing from an exhausting, inspirational few days at Knowledge16, the annual ServiceNow event…

From humble beginnings (my first Knowledge was a few hundred attendees in a tent in San Diego), Knowledge has become a global tour de force. This year, Mandalay Bay could barely contain more than 11,000 customers and partners (and the expo hall could barely contain more than 100 decibels of the tech equivalent of Queensryche). Getting into the keynote felt like rush hour on the subway in midtown Manhattan. 

Ask yourself these questions to find the right fit in an alert correlation platform.

To maintain operational visibility in modern IT environments, companies are abandoning monolithic monitoring solutions from legacy vendors in favor of a modern set of “best of breed” monitoring tools. Today’s average IT monitoring stack consists of about 6-8 tools, including at least one from each of the following categories: systems monitoring, end user monitoring, application performance monitoring (APM), error detection, log analytics, chat, and ticketing. When service disruptions occur, operations engineers face a flood of alerts across different layers of the IT stack, with no fast way to figure out what’s really going on. Customers are left stranded, while IT professionals struggle to detect, triage and remediate urgent issues. Downtime abounds which negatively impacts revenue, performance, and brand loyalty.

This post was recently published as a guest blog by our friends at Jira Service Desk. You can find the original post here.

We all need to move fast in order to stay competitive. But the faster things move, the faster things break.

While many companies have made great strides towards automating application release and infrastructure management, automation for service assurance has been sorely lacking. That’s left Dev and Ops with a problem: how to effectively service alerts that have grown by orders of magnitude.

You and Nagios have had your share of ups and downs, but lately it just hasn’t been the same. You’re busy with other systems and having less patience to deal with the constant nagging from good ol’ Nagios. She’s beginning to sound more and more like a broken record.

If you keep going down this path, your MTTR will suffer. Or worse, you won’t be able to satisfy your customers. But you’re not ready to give up on Nagios just yet – after all, she’s been there when you needed her.

For many IT and Ops teams, Nagios is both a blessing and a curse. On the one hand, Nagios gives you near real-time visibility into the inner workings of your IT infrastructure. But on the other hand, Nagios can generate so many alerts that it’s impossible for any single person (or even any team) to keep up.

This is part two of a two-part post about using event correlation to thwart DDoS attacks. Channeling Mark Twain: it would have been shorter if I had more time. In the last post I described why DDoS attacks for SaaS providers are no different than performance and availability issues experienced in other domains like healthcare, finance, or retail. In this post I’ll share a customer story about a security breach that never happened… thanks to a savvy DevOps team and data science.

Every company’s a target, every customer’s at risk. But the now-cliched threat of data breaches from Distributed Denial of Service (DDoS) attacks obscures a bigger threat: outages that impact not just data integrity but also profitability, brand equity, and customer retention. 

The volume of attacks is growing and so is the impact of down time. According to Akamai’s most recent State of the Internet report, DDoS attacks are a bigger threat than ever before. “The number of DDoS attacks continued to increase substantially in Q2 2015, more than doubling the number observed in Q2 2014.”

At BigPanda, we’re committed to giving you the tools and information you need to be successful. In keeping with this goal, we’re excited to announce BigPanda Docs, our revamped help documentation that features more content, better navigation, and more ways for you to give us feedback. 

Enterprise application and computing environments have changed radically over the past fifteen years. Anyone who has spent even a day in an IT role can tell you that.What gets less attention, however, is how those changes undermine the ability of operations teams to do their jobs. The problem is that as computing and application environments have changed dramatically, workflows and org charts have not.

Data center growth over the last 15 years has created significant growing pains in terms of data center management.  Tasks that once could be done manually by IT teams have hit the limits of scalability, cost, and efficiency.  The key to enabling IT to meet these challenges involves one key theme: automation.

Ansible is a great automation tool. We use it for server provisioning, application deployments and running maintenance scripts. One problem it does have however, is how (in)convenient it is to run playbooks as opposed to regular shell scripts. Write and run enough Ansible playbooks, and eventually you’ll get tired of the repetitive typing your fingers have to do.

Modeling your production environment correctly is very important for development. Developers need to be able to run and test their code locally for the development process to be efficient, and many times this requires setting up infrastructure that exists in production on their local machines. The basic solution is a simple Vagrant box containing all your infrastructure and application code, like the one we mentioned in our Devbox post. 

Anomaly detection for monitoring has been a trending topic in recent years. And while the math behind it is fascinating, too much of the discussion has revolved around histograms, moving averages and standard deviations. More discussion needs to happen around its practical applications, and for that reason, this practical guide to anomaly detection will attempt to provide an actionable overview of current off-the-shelf anomaly detection tools.