We use both email handler & Datadog handler. We were hit by a situation where there was a memory leak (with chef-client in daemon mode) which caused the handler also to fail without enough memory. We ended up fixing the memory leak and changed chef-client execution to task instead of service.
From: Mike [mailto:firstname.lastname@example.org]
Sent: Wednesday, September 17, 2014 4:10 AM
Subject: [chef] Re: Re: Best practice for measuring and monitoring chef-client runs?
On Sep 16, 2014 8:39 PM, “Jeff Byrnes” <email@example.com:firstname.lastname@example.org> wrote:
We use an email handler to report runs; primarily filtered for failed runs. Crude, but it works.
On Tue, Sep 16, 2014 at 8:33 PM, Augie Schwer <email@example.com:firstname.lastname@example.org> wrote:
What are people using to monitor and measure their chef-client runs?
I would like to monitor for when chef-client runs fail on a node.
It would be nice to measure chef-client run times.
Is it safe to assume people are using handlers for both of these? What are some popular ways to accomplish these goals? Thanks!
Augie Schwer - Augie@Schwer.usmailto:Augie@Schwer.us - http://schwer.us
Click herehttps://www.mailcontrol.com/sr/MZbqvYs5QwJvpeaetUwhCQ== to report this email as spam.
SDL PLC confidential, all rights reserved.
If you are not the intended recipient of this mail SDL requests and requires that you delete it without acting upon or copying any of its contents,
and we further request that you advise us.
SDL PLC is a public limited company registered in England and Wales.
Registered number: 02675207.
Registered address: Globe House, Clivemont Road, Maidenhead, Berkshire SL6 7DY, UK.
This message has been scanned for malware by Websense. www.websense.com