{"id":2263,"date":"2018-11-02T02:31:42","date_gmt":"2018-11-02T02:31:42","guid":{"rendered":"https:\/\/www.appservgrid.com\/paw92\/?p=2263"},"modified":"2018-11-02T09:45:26","modified_gmt":"2018-11-02T09:45:26","slug":"the-monitoring-issue-linux-journal","status":"publish","type":"post","link":"https:\/\/www.appservgrid.com\/paw92\/index.php\/2018\/11\/02\/the-monitoring-issue-linux-journal\/","title":{"rendered":"The Monitoring Issue | Linux Journal"},"content":{"rendered":"<p>In 1935, Austrian physicist, Erwin Schr\u00f6dinger, still flying high after his<br \/>\nNobel Prize win from two years earlier, created a simple thought experiment.<\/p>\n<p>It ran something like this:<\/p>\n<blockquote><p>If you have a file server, you cannot know if that server is up or<br \/>\ndown&#8230;until you check on it. Thus, until you use it, a file server<br \/>\nis\u2014in a sense\u2014both up and down. At the same time.<\/p><\/blockquote>\n<p>This little brain teaser became known as Schr\u00f6dinger&#8217;s File Server, and<br \/>\nit&#8217;s<br \/>\nregarded as the first known critical research on the intersection of Systems<br \/>\nAdministration and Quantum Superposition. (Though, why Erwin chose,<br \/>\nspecifically, to use a &#8220;file server&#8221; as an example remains a bit of a<br \/>\nmystery\u2014as the experiment works equally well with any type of server.<br \/>\nIt&#8217;s like,<br \/>\nwe get it, Erwin. You have a nice NAS. Get over it.)<\/p>\n<p>&#8230; Okay, perhaps it didn&#8217;t go exactly like that. But I&#8217;m confident it would<br \/>\nhave&#8230;you know&#8230;had good old Erwin had a nice Network Attached Storage<br \/>\nserver instead of a cat.<\/p>\n<p>Regardless, the lessons from that experiment certainly hold true for servers.<br \/>\nIf you haven&#8217;t checked on your server recently, how can you be truly sure<br \/>\nit&#8217;s running properly? Heck, it might not even be running at all!<\/p>\n<p>Monitoring a server\u2014to be notified when problems occur or, even better,<br \/>\nwhen problems look like they are about to occur\u2014seems, at first blush, to<br \/>\nbe a simple task. Write a script to ping a server, then email me when the<br \/>\nping times out. Run that script every few minutes and, shazam, we&#8217;ve got a<br \/>\nserver monitoring solution! Easy-peasy, time for lunch!<\/p>\n<p>Whoah, there! Not so fast!<\/p>\n<p>That server monitoring solution right there? It stinks. It&#8217;s fragile. It<br \/>\ngives you very little information (other than the results of a ping). Even<br \/>\nfor administering your own home server, that&#8217;s barely enough information and<br \/>\nmonitoring to keep things running smoothly.<\/p>\n<p>Even if you have a more robust solution in place, odds are there are<br \/>\nsignificant shortcomings and problems with it. Luckily, <em>Linux<br \/>\nJournal<\/em> has<br \/>\nyour back\u2014this issue is chock full of advice, tips and tricks for how to<br \/>\nkeep your servers effectively monitored.<\/p>\n<p>You know, so you&#8217;re not just guessing of the cat is still alive in there.<\/p>\n<p>Mike Julian (author of O&#8217;Reilly&#8217;s <em>Practical Monitoring<\/em>) goes into detail on a<br \/>\nbunch of the ways your monitoring solution needs serious work in his<br \/>\nadorably titled &#8220;Why Your Server Monitoring (Still) Sucks&#8221; article.<\/p>\n<p>We continue &#8220;telling it like it is&#8221; with Corey Quinn&#8217;s treatise on Amazon&#8217;s<br \/>\nCloudWatch, &#8220;CloudWatch Is of the Devil, but I Must Use It&#8221;. Seriously,<br \/>\nCorey, tell us how you really feel.<\/p>\n<p>With our cathartic, venting session behind us, we&#8217;ve got a detailed, hands-on<br \/>\nwalk-through of how to use Monit (an open-source process supervisor for<br \/>\nLinux) coupled with RRDtool (a GPL&#8217;d tool for capturing data over long<br \/>\nperiods of time, such as from shell scripts, and graphing it) to monitor your<br \/>\nserver in a fairly simple, and very open-source, way.<\/p>\n<p>Round that out with an interview with Steve Newman (one of the folks who<br \/>\ncreated Writely, which you might know as Google Docs, following Google&#8217;s<br \/>\nacquisition in 2006) on his company, Scalyr, which handles server monitoring<br \/>\nand log management\u2014and you&#8217;ve got more server monitoring information than you can shake a stick at. Or, you can go back to guessing if the cat is still alive. That&#8217;s fun too.<\/p>\n<p><a href=\"https:\/\/www.linuxjournal.com\/content\/monitoring-issue\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In 1935, Austrian physicist, Erwin Schr\u00f6dinger, still flying high after his Nobel Prize win from two years earlier, created a simple thought experiment. It ran something like this: If you have a file server, you cannot know if that server is up or down&#8230;until you check on it. Thus, until you use it, a file &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/www.appservgrid.com\/paw92\/index.php\/2018\/11\/02\/the-monitoring-issue-linux-journal\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;The Monitoring Issue | Linux Journal&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2263","post","type-post","status-publish","format-standard","hentry","category-linux"],"_links":{"self":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts\/2263","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/comments?post=2263"}],"version-history":[{"count":1,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts\/2263\/revisions"}],"predecessor-version":[{"id":2317,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts\/2263\/revisions\/2317"}],"wp:attachment":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/media?parent=2263"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/categories?post=2263"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/tags?post=2263"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}