Architecture or Hardware Specs for a Chef Server with around 15k nodes

okram999 · April 11, 2016, 7:58pm

I have been mostly working on the chef-client side, so far. Please throw in some tips, to get my research started. I am looking to set up a chef server with around 15k nodes.
Are there different set up or architectures of setting up the chef server?
What is the recommended hardware spec for this kind of server. I think the frequency of chef-client runs on each nodes will play a major role too in the server’s harsware spec

nclemons · April 11, 2016, 8:00pm

Off the cuff, ensure you have at least 4G of RAM. You may need more for that many nodes, I haven’t run at that scale, but definitely don’t skimp on the RAM side of the equation.

Nathan Clemons

DevOps Engineer

Moxie Cloud Services (MCS)

O +1.425.467.5075

M +1.360.861.6291

E nclemons@gomoxie.com

W www.gomoxie.com http://www.gomoxie.com/

Galen_Emery1 · April 11, 2016, 8:33pm

We actually have a pretty good doc on scaling the chef server. (1)

As you mentioned, the scale concern isn’t so much the nodes but the
frequency of check ins. In addition to the frequency, use of search,
storing additional attributes in the node object, or storing fewer
attributes all have an effect on the size you’ll need.

One of our engineers, Irving has a great blog post on scaling the chef
server(2), where he goes into some of the details.

The tl;dr is a chef server at 15k nodes needs to be pretty beefy and you
might want to consider setting up high availability and/or replication to
break that up.

https://docs.chef.io/server_components.html

http://irvingpop.github.io/blog/2015/04/20/tuning-the-chef-server-for-scale/

–Mobile Galen

okram999 · April 12, 2016, 5:45am

What is a node object being referenced below?

The default maximum allowable size for a node object is 1MB, although it
_ is rare for nodes to exceed 150KB. Though compressed, this data is _
_replicated twice, once in Apache Solr, and once in PostgreSQL. In _
_practice, allowing a conservative 2MB of storage on the disk partition _
per node should be sufficient

kallistec · April 12, 2016, 3:24pm

It's the ruby object that you reference as node in your cookbooks. The bulk of its data is from ohai and your default/normal/override attributes. It gets sent to the server in JSON form, which is what I would guess those measurements are based on.

okram999 · April 13, 2016, 10:38pm

This is not going to be on AWS but inside a data center. And i think the HA comes out of box for aws.

I am looking at the tired set-up. With the tired setup how do i install to have multiple backend servers? Will the backend servers replicate one another?

shortdudey123 · April 14, 2016, 12:31am

I run 2 frontends and 3 backends in my setup. I have both frontends
pointed to 1 backend. Each backend runs GlusterFS and mounts it to the
data directory (/var/opt/opscode/) for chef. Fair warning though, the chef
docs (https://docs.chef.io/chef_system_requirements.html) say “The Chef
server MUST NOT use a network file system of any type”, however, i have not
had any issues with several thousand nodes.

-Grant

Topic		Replies	Views
What kind of load can Chef Server support? Chef Infra (archive)	4	348	January 28, 2012
Right sizing Chef11 Server Chef Infra (archive)	6	340	September 3, 2013
AWS HA - instance sizing Chef Infra (archive)	2	282	June 14, 2015
Controling Memory usage Chef Infra (archive)	3	389	January 29, 2013
Performance and scalability of a chef server Chef Infra (archive)	4	521	August 9, 2011

Architecture or Hardware Specs for a Chef Server with around 15k nodes

Related topics