Everything is a Freaking DNS problem - icinga http://127.0.0.1:8080/blog/taxonomy/term/1292/0 en check_graphite http://127.0.0.1:8080/blog/checkgraphite <p>During my Puppetcamp Gent talk last week, I explained how to get alerts based on trends from graphite. A number of people asked ,e how to do that.</p> <p>First lets quickly explain why you might want to do that .<br /> Sometimes you don't care about the current value of a metric..as an example take a Queing system .. there is no problem if there are messages added to the queue, not even if there are a lot of messages on the queue, there might however be a problem if over a certain period the number of messages on a queue stays to high.</p> <p>In this example I`m monitoring the queue length of a hornetq setup which is exposed by JMX.<br /> On the server runnnig HornetQ I have an exported resource that tells the JMXTrans server to send the MessageCount to graphite<br /> (you could also do this using collectd plugins) </p> <p><div class="geshifilter"><pre class="text geshifilter-text" style="font-family:monospace;"><ol><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">@@jmxtrans::graphite {&quot;MessageCountMonitor-${::fqdn}&quot;:</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> jmxhost =&gt; hiera('hornetqserver'),</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> jmxport =&gt; &quot;5446&quot;,</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> objtype =&gt; 'org.hornetq:type=Queue,*',</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> attributes =&gt; '&quot;MessageCount&quot;,&quot;MessagesAdded&quot;,&quot;ConsrCount&quot;',</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> resultalias =&gt; &quot;hornetq&quot;,</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> typenames =&gt; &quot;name&quot;,</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> graphitehost =&gt; hiera('graphite'),</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal"> graphiteport =&gt; &quot;2003&quot;,</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">}</div></li></ol></pre></div></p> <p>This gives me a computable url on which I can get the graphite view </p> <p>The next step then is to configure a nagios check that verifies this data. For that I need to use the <a href="http://github.com/datacratic/check_graphite" rel="nofollow">check_graphite</a> plugin from Datacratic ..</p> <p>Which can work with an nrpe config like<br /> <div class="geshifilter"><pre class="text geshifilter-text" style="font-family:monospace;"><ol><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">### File managed with puppet ###</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">### Served by: '&lt;%= scope.lookupvar('::servername') %&gt;'</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">### Module: '&lt;%= scope.to_hash['module_name'] %&gt;'</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">### Template source: '&lt;%= template_source %&gt;'</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">&nbsp;</div></li><li style="font-family: monospace; font-weight: normal;"><div style="font-family: monospace; font-weight: normal; font-style: normal">command[check_hornetq]=/usr/lib64/nagios/plugins/check_graphite -u &quot;http://&lt;%= graphitehost%&gt;/render?target=servers.&lt;%= scope.lookupvar('::fqdn').gsub(/\./,'_')%&gt;_5446.hornetq.docstore_private_trigger_notification.MessageCount&amp;from=-30minutes&amp;rawData=true&quot; -w 2000 -c 20000</div></li></ol></pre></div></p> <p>I define this check on the host where HornetQ is running as it then will map to that host on Icinga/Nagios rather than throw a host error on an unrelated host.</p> http://127.0.0.1:8080/blog/checkgraphite#comments graphite icinga monitoringlove monitoringsucks puppet Tue, 05 Feb 2013 09:10:15 +0000 Kris Buytaert 1076 at http://127.0.0.1:8080/blog #monitoringsucks and we'll fix it ! http://127.0.0.1:8080/blog/monitoringsucks-and-well-fix-it <p>If you are hacking on monitoring solutions, and want to talk to your peers solving the problem<br /> Block the monday and tuesday after fosdem in your calendar !</p> <p>That's right on february 6 and 7 a bunch of people interrested to fix the problem will be meeting , discussing and hacking stuff together in Antwerp</p> <p>In short a #monitoringsucks hackathon</p> <p>Inuits is opening up their offices for everybody who wants to join the effort Please let us (@KrisBuytaert and @patrickdebois) know if you want to join us in Antwerp</p> <p>Obviously if you can't make it to Antwerp you can join the effort on ##monitoringsucks on Freenode or on Twitter.</p> <p>The <a href="http://www.inuits.eu/contact" rel="nofollow">location will be Duboistraat 50 , Antwerp</a><br /> It is about 10 minutes walk from the Antwerp Central Trainstation<br /> Depending on Traffic Antwerp is about half an hour north of Brussels and there are hotels at walking distance from the venue.</p> <p>Plenty of parking space is available on the other side of the Park</p> http://127.0.0.1:8080/blog/monitoringsucks-and-well-fix-it#comments collectd devops ganglia graphite icinga monitoring monitoringsucks munin nagios rrd Tue, 03 Jan 2012 18:23:00 +0000 Kris Buytaert 1057 at http://127.0.0.1:8080/blog MonitoringForge http://127.0.0.1:8080/blog/monitoringforge <p>I found the initial announcement in my Spambox, thats where I send all the mails from GroundWork as unsubscribing from their marketing lists doesn't seem to work ..</p> <p><a href="http://www.adventuresinoss.com/?p=1117" rel="nofollow">Tarus</a> pointed it out.. he already mentionned Marketing, low community imput etc..</p> <p>Nevertheles I had a look, and my first and last impression was, NagiosForge, no active community, and empty pages on other imvho more relevant Monitoring Projects created by GroundWork employees.</p> <p><a href="http://www.johnmwillis.com/" rel="nofollow">Big John </a> coined the "to pull a GroundWork" phrase aiming at their early products, err logo placements. When <a href="http://www.inuits.be">we</a> were evaluating differen Open Source monitoring tools for our OLS paper about 18 months ago we looked at the tool and it didn't make the cut .. it failed in all areas where it claimed to be strong, ease of installation , lack of errorhandling during the installation, etc ... some while later another collegue tried again and we ran into similar problems</p> <p>Yet some people claim it to be a great product.. I'm not a fan yet ...maybe one day they they'll know to convince me .. but for now<br /> When I need Nagios .. I`ll be looking at <a href="http://djagios.org/" rel="nofollow">Djagios</a> , <a href="http://opsview.org/" rel="nofollow">Opsview</a> , <a href="http://www.icinga.org/" rel="nofollow">Icinga</a> or the real thing.</p> <p>However, ther's still Hyperic, Zenoss and Zabbix :)</p> http://127.0.0.1:8080/blog/monitoringforge#comments hyperic icinga look no link monitoring nagios zabbix zenos Wed, 23 Sep 2009 19:20:57 +0000 Kris Buytaert 939 at http://127.0.0.1:8080/blog