Jump to content

Server Monitoring


Recommended Posts

The TotalChoice Web Hosting Operations Center is located in Burton, Michigan and is staffed twenty-four hours per day, seven days a week.

The TotalChoice Web Hosting Operations Center provides round-the-clock monitoring of all hardware, including routers, switches, UPS systems, and servers. In brief, this comprises; Monitoring of all critical services and ports, including FTP, HTTP, SMTP, HTTPS, SSH, TELNET, CPANEL and POP3.

 

Routine network monitoring of the Local Area Network, Internet connectivity (all routers, switches, and wiring), and the Internet backbone via MRTG and our custom tailored software. Close and constant communication with our data center providers to ensure that our customers have constant access to high-performance Internet connections 24 hours per day, seven days per week. An instant paging system to alert our staff to a failure on any monitoring service. This is supplemented by audible alarms and LAN messages.

 

TotalChoice Web Hosting Server Monitoring

Link to comment
Share on other sites

  • 3 years later...

We have added the following monitoring on each TCH server:

 

CPU Load

MySQL Processes

Age of Local Backups (insures we have fresh local backups)

Mail Queue Size Warnings

SSL

Shared SSL Expiration

Webmail

DNS

Hard Disk Space

MailMan & Qrunner

Servertime

SSH Port

Link to comment
Share on other sites

Things just keep getting better and better around here.

 

Vivek has created a monitoring plugin to monitor the smartd daemon on all servers.

 

We are still working out the bugs for the monitoring protocol, however it looks very very promising.

 

This should be fully implemented by end of this week.

 

 

You may asak what smartd is and I offer this

 

"S.M.A.R.T. technology

 

 

S.M.A.R.T. stands for Self-Monitoring Analysis and Reporting Technology. S.M.A.R.T. technology was developed by a number of major hard disk drive manufacturers in a concerted effort to increase the reliability of drives. It is a technology that enables the PC to predict the future failure of hard disk drives. S.M.A.R.T. technology has become an industry standard for hard drive manufacturers.

 

Through the S.M.A.R.T. system, modern hard disk drives incorporate a suite of advanced diagnostics that monitor the internal operations of a drive and provide an early warning for many types of potential problems. When a potential problem is detected, the drive can be repaired or replaced before any data is lost or damaged.

 

The S.M.A.R.T. system monitors the drive for anything that might seem out of the ordinary, documents it, and analyzes the data. If it sees something that indicates a problem, it is capable of notifying the user (or system administrator). S.M.A.R.T. monitors disk performance, faulty sectors, recalibration, CRC errors, drive spin-up time, drive heads, distance between the heads and the disk platters, drive temperature, and characteristics of the media, motor and servomechanisms. The errors the system can detect can be predicted by a number of methods. Currently the SMART system can detect about 70% of all hard drive errors.

 

Here's an example: motor and/or bearing failure can be predicted by an increase in the drive spin-up time and the number of retries it takes to get the drive spinning at full speed. Or, if the drive notes that error correction is being needed excessively, it can attribute this to a broken drive head or surface contamination, and it will create an alert before the problem gets worse. Armed with a prediction of failure, the user or system administrator can make a backup copy of key data, replace a suspect device prior to data loss, and avoid undesired downtime."

Link to comment
Share on other sites

×
×
  • Create New...