Introducing ‘Design Patterns for Monitoring’ - Monitoring Email
By Vladimir Vuksan ~ December 4th, 2009. Filed under: Design Patterns for Monitoring, Monitoring Email.
In a recent MonitoringForge advisory board e-mail thread Tara Spalding initiated
a discussion on ways to spark content contributions to MonitoringForge, specifically Wiki documentation. She was at LISA ‘09 and discussed with David Nalley of Fedora project what would be most valuable initial contributions. To cut the long story short the opinion coalesced over the idea to organize the Wiki in form of monitoring design recipes for individual services e.g. email monitoring design, web monitoring design etc. These design documents would cover both types of monitoring ie.
- Service availability/validation ie. e-mail service up and down, etc.
- Performance monitoring/trending
Within those individual areas we would start with simple/easy approaches and work our way up ie. most people will want to know how to monitor whether their e-mail server is up (usually easy to do) however may not need or want to monitor SMTP authentication. David Nalley did a great job of outlining a sample mail monitoring design recipe as follows:
*Mail
**SMTP
***Availability Monitoring
****Port 25 open
****Port 25 providing expected response
****MTA accepting mail
***Performance Monitoring
****Messages Received/time unit
****Messages Sent/time unit
****4xx errors /time unit
****5xx errors/time unit
****Length of time from successful send till message received by MDA
**IMAP
***Availability Monitoring
****blah blah
***Performance Monitoring
**** blah blah
Where things get tricky is that there are a number of monitoring tools so the question is how do we deal with describing individual implementations. Do we first describe concept in abstract then address individual monitoring tools implementations or is there are better way ?
We would like to get consensus whether the approach outlined is a good approach and are seeking input on how to make this process more effective and more useful.
|
|
|
|
|
|
