Using Nagios XI To Empower Your
Developers To Own Their Own Checks
Nick Winn
Email: technick@gmail.com
Twitter: techni...
Hi my name is Nick
• Been around doing IT stuff since 1999
• Seen a lot of mistakes in humongous
environments – especia...
Who do I work for?
Return Path is the global leader in email intelligence. Our products are powered with proprietary data...
Disclaimer
I am still a Nagios Ninja In
Training
I can only tell you how I have gotten things to work in my
organizati...
The Nagios XI Story
• The monitoring problems we faced as a
whole and how it slowed down release
iterations.
• How doe...
Our broken monitoring process
• Developers lacked training and access.
• Iterations moved quickly and would bottleneck w...
Solving Problems with XI
• The GUI lowered the bar of entry allowing more people
to become comfortable with Nagios XI.
...
Short Falls
• Nagios XI on it own is a great product though it was not
designed to have “many hands in the pot”.
• It i...
Making Nagios XI Work
• Communication within teams
• CCM does not lock rows in databases allowing two separate
instance...
Making Nagios XI Work
• Naming Standards
• Things can go wrong without naming standards. How do you
define what group o...
Naming Examples
• Service Checks (Both config & desc name)
• group-servicename-optional information
certification-hotma...
Naming Examples
• Service Groups
• Top level group setup to read in all sub groups.
• certification-all
• group-servic...
Benefits of proper naming
• Organization
• Guides people and are some what self
explanatory
• At a future point, more ...
Making Nagios XI Work
• Developed solid future roadmap of possibilities
• This is more of a lesson learned from my exper...
Problems…
• Manually adding and removing users because ldap / ad
integration isn’t there yet.
• Every Developer needs t...
Conclusions
• Nagios XI has empowered large groups of developers
in my company to manage and own service checks
related...
Developer Feedback
• Can more quickly add alerts to our products without
having to wait for a free OPs resource
• More ...
Other Thoughts
Questions?
Any questions?
Thanks!
The End
Nick Winn
Email: technick@gmail.com
Twitter: technick
Nagios Forums: technick
of 20

Nagios Conference 2014 - Nick Winn - Using Nagios XI to Empower Your Developers to Own Their Checks

Nick Winn's presentation on Using Nagios XI to Empower Your Developers to Own Their Checks. The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Published on: Mar 3, 2016
Published in: Technology      
Source: www.slideshare.net


Transcripts - Nagios Conference 2014 - Nick Winn - Using Nagios XI to Empower Your Developers to Own Their Checks

  • 1. Using Nagios XI To Empower Your Developers To Own Their Own Checks Nick Winn Email: technick@gmail.com Twitter: technick Nagios Forums: technick
  • 2. Hi my name is Nick • Been around doing IT stuff since 1999 • Seen a lot of mistakes in humongous environments – especially around monitoring. • Taken down my fair share of production networks – opps, my bad. • Believes in keeping everything simple and never repeating yourself. (K.I.S.S.) • Hasn’t stopped learning… • Has used Nagios since 2007
  • 3. Who do I work for? Return Path is the global leader in email intelligence. Our products are powered with proprietary data that enables our clients to get delivered to the inbox (and stay there), optimize their messages to stand out from the competition and protect their brands from phishing and spoofing attacks. We maximize the performance and accountability of email, build trust across the entire email ecosystem and protect users from spam and other abuse.
  • 4. Disclaimer I am still a Nagios Ninja In Training I can only tell you how I have gotten things to work in my organization. Parts of this or as a whole might not be the right fit for your organization. Nagios XI offers you many ways to accomplish different possibilities and I have have not tried every possible solution out there.
  • 5. The Nagios XI Story • The monitoring problems we faced as a whole and how it slowed down release iterations. • How does Nagios XI solve these problems? • The methodologies and shortcuts required to make this work.
  • 6. Our broken monitoring process • Developers lacked training and access. • Iterations moved quickly and would bottleneck while waiting on monitoring. • The teams that had access to previous installation didn’t fully understand it. • The revision control system was over complicated, enough to persuade people away. • Many people were not comfortable on a command line.
  • 7. Solving Problems with XI • The GUI lowered the bar of entry allowing more people to become comfortable with Nagios XI. • Iterations were no longer bottlenecked while waiting for modifications to monitoring. • Training across the organization made people comfortable with the power of Nagios XI • Nagios XI has configuration snapshots built in, so if things went south, it was easy enough to roll back configurations.
  • 8. Short Falls • Nagios XI on it own is a great product though it was not designed to have “many hands in the pot”. • It is the same story with the Core Configuration Manager (CCM).
  • 9. Making Nagios XI Work • Communication within teams • CCM does not lock rows in databases allowing two separate instances of editing to take place, with the last update over writing the previous.
  • 10. Making Nagios XI Work • Naming Standards • Things can go wrong without naming standards. How do you define what group owns what Checks? Groups? Hosts? • Correct naming of objects save time and allows the viewer to understand what it is right away. • Will guarantee naming of objects does not clash between groups using Nagios XI. • Predictable names allow for manipulation of the CCM database and easier integration with static files. • Nagios sorts everything alphabetically (0-9,a-z), which can be used in our favor for readability.
  • 11. Naming Examples • Service Checks (Both config & desc name) • group-servicename-optional information certification-hotmail_feed-fbl certification-message_parser_backend-00 webintegration-http_available-p8210 webintegration-https_sslexp-partnersite.com dbo-tnsnames ops-ssh-p2200 ops-disk
  • 12. Naming Examples • Service Groups • Top level group setup to read in all sub groups. • certification-all • group-servicename-optional information certification-hotmail certification-parsers certification-activemq
  • 13. Benefits of proper naming • Organization • Guides people and are some what self explanatory • At a future point, more will be done by editing the CCM database directly. • Will allow easier automated interaction with Nagios XI
  • 14. Making Nagios XI Work • Developed solid future roadmap of possibilities • This is more of a lesson learned from my experiences with Nagios XI and how things can become a headache. Let me explain…
  • 15. Problems… • Manually adding and removing users because ldap / ad integration isn’t there yet. • Every Developer needs to be an administrator of Nagios XI and CCM • Developers can control the monitoring engine and reset / change other users credentials.
  • 16. Conclusions • Nagios XI has empowered large groups of developers in my company to manage and own service checks related to their platform. • Developing a standards system inside Nagios XI has enabled my team to trust our developers with this task that has been traditionally limited to operations or application support. • Nagios XI can be used as a efficient component for stream lining the release process that’s required of agile shops.
  • 17. Developer Feedback • Can more quickly add alerts to our products without having to wait for a free OPs resource • More flexibility in determining the alerting procedures. Better insight into what, when, and how we alert. • It is nice to have developers able to manage alert setup and maintenance windows. • It's nice to not worry about conf files. • Broader interaction with development teams and learning how we monitor things • Much easier to visualize creating the checks through the Nagios XI interface. Much easier to maintain them in this format.
  • 18. Other Thoughts
  • 19. Questions? Any questions? Thanks!
  • 20. The End Nick Winn Email: technick@gmail.com Twitter: technick Nagios Forums: technick

Related Documents