2012-05-13 17:56:09 +02:00
Best Practices
==============
2013-10-05 20:57:45 +02:00
Here are some tips for making the most of Ansible playbooks.
2012-05-13 17:56:09 +02:00
2013-10-05 20:57:45 +02:00
You can find some example playbooks illustrating these best practices in our `ansible-examples repository <https://github.com/ansible/ansible-examples> `_ . (NOTE: These may not use all of the features in the latest release, but are still an excellent reference!).
2013-03-20 17:45:41 +01:00
2013-10-05 00:34:39 +02:00
.. _content_organization:
2013-02-19 05:29:27 +01:00
Content Organization
++++++++++++++++++++++
2013-10-05 20:57:45 +02:00
The following section shows one of many possible ways to organize playbook content. Your usage of Ansible should fit your needs, however, not ours, so feel free to modify this approach and organize as you see fit.
2013-02-19 05:29:27 +01:00
2013-04-13 00:26:17 +02:00
(One thing you will definitely want to do though, is use the "roles" organization feature, which is documented as part
2013-10-05 20:57:45 +02:00
of the main playbooks page. See :doc: `playbooks_roles` ).
2013-04-13 00:26:17 +02:00
2013-10-05 00:34:39 +02:00
.. _directory_layout:
2013-02-19 05:29:27 +01:00
Directory Layout
`` ` ` ` ` ` ` ` ` ` ` ` ` ``
The top level of the directory would contain files and directories like so::
2013-04-13 00:26:17 +02:00
production # inventory file for production servers
stage # inventory file for stage environment
2013-02-19 05:29:27 +01:00
group_vars/
2013-04-13 00:26:17 +02:00
group1 # here we assign variables to particular groups
group2 # ""
2013-02-19 05:29:27 +01:00
host_vars/
2013-04-13 00:26:17 +02:00
hostname1 # if systems need specific variables, put them here
hostname2 # ""
site.yml # master playbook
webservers.yml # playbook for webserver tier
dbservers.yml # playbook for dbserver tier
roles/
common/ # this hierarchy represents a "role"
tasks/ #
main.yml # <-- tasks file can include smaller files if warranted
handlers/ #
main.yml # <-- handlers file
templates/ # <-- files for use with the template resource
ntp.conf.j2 # <------- templates end in .j2
files/ #
bar.txt # <-- files for use with the copy resource
2013-05-25 16:51:59 +02:00
foo.sh # <-- script files for use with the script resource
2013-07-16 22:53:20 +02:00
vars/ #
main.yml # <-- variables associated with this role
2013-04-13 00:26:17 +02:00
webtier/ # same kind of structure as "common" was above, done for the webtier role
monitoring/ # ""
fooapp/ # ""
2013-02-19 05:29:27 +01:00
2013-10-05 00:34:39 +02:00
.. _stage_vs_prod:
2013-02-19 05:29:27 +01:00
How to Arrange Inventory, Stage vs Production
`` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ``
2013-10-05 20:57:45 +02:00
In the example below, the *production* file contains the inventory of all of your production hosts. Of course you can pull inventory from an external data source as well, but this is just a basic example.
It is suggested that you define groups based on purpose of the host (roles) and also geography or datacenter location (if applicable)::
2013-02-19 05:29:27 +01:00
# file: production
[atlanta-webservers]
www-atl-1.example.com
www-atl-2.example.com
[boston-webservers]
www-bos-1.example.com
www-bos-2.example.com
[atlanta-dbservers]
db-atl-1.example.com
db-atl-2.example.com
[boston-dbservers]
db-bos-1.example.com
# webservers in all geos
[webservers:children]
atlanta-webservers
boston-webservers
# dbservers in all geos
[dbservers:children]
atlanta-dbservers
boston-dbservers
# everything in the atlanta geo
[atlanta:children]
atlanta-webservers
atlanta-dbservers
# everything in the boston geo
[boston:children]
boston-webservers
boston-dbservers
2013-10-05 20:57:45 +02:00
2013-10-05 00:34:39 +02:00
.. _groups_and_hosts:
2013-02-19 05:29:27 +01:00
Group And Host Variables
`` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ``
2013-10-05 20:57:45 +02:00
Now, groups are nice for organization, but that's not all groups are good for. You can also assign variables to them! For instance, atlanta has its own NTP servers, so when setting up ntp.conf, we should use them. Let's set those now::
2013-02-19 05:29:27 +01:00
---
# file: group_vars/atlanta
ntp: ntp-atlanta.example.com
backup: backup-atlanta.example.com
2013-10-05 20:57:45 +02:00
Variables aren't just for geographic information either! Maybe the webservers have some configuration that doesn't make sense for the database servers::
2013-02-19 05:29:27 +01:00
---
# file: group_vars/webservers
apacheMaxRequestsPerChild: 3000
apacheMaxClients: 900
If we had any default values, or values that were universally true, we would put them in a file called group_vars/all::
---
# file: group_vars/all
ntp: ntp-boston.example.com
backup: backup-boston.example.com
We can define specific hardware variance in systems in a host_vars file, but avoid doing this unless you need to::
---
# file: host_vars/db-bos-1.example.com
foo_agent_port: 86
bar_agent_port: 99
2013-10-05 00:34:39 +02:00
.. _split_by_role:
2013-04-21 21:33:51 +02:00
Top Level Playbooks Are Separated By Role
2013-02-19 05:29:27 +01:00
`` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ``
In site.yml, we include a playbook that defines our entire infrastructure. Note this is SUPER short, because it's just including
2013-07-16 22:53:20 +02:00
some other playbooks. Remember, playbooks are nothing more than lists of plays::
2013-02-19 05:29:27 +01:00
---
# file: site.yml
- include: webservers.yml
- include: dbservers.yml
In a file like webservers.yml (also at the top level), we simply map the configuration of the webservers group to the roles performed by the webservers group. Also notice this is incredibly short. For example::
---
# file: webservers.yml
- hosts: webservers
2013-04-13 00:26:17 +02:00
roles:
- common
- webtier
2013-02-19 05:29:27 +01:00
2013-10-05 00:34:39 +02:00
.. _role_organization:
2013-02-19 05:29:27 +01:00
Task And Handler Organization For A Role
`` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ``
2013-07-16 22:53:20 +02:00
Below is an example tasks file that explains how a role works. Our common role here just sets up NTP, but it could do more if we wanted::
2013-02-19 05:29:27 +01:00
---
2013-04-13 00:26:17 +02:00
# file: roles/common/tasks/main.yml
2013-02-19 05:29:27 +01:00
- name: be sure ntp is installed
yum: pkg=ntp state=installed
tags: ntp
- name: be sure ntp is configured
2013-07-04 15:31:56 +02:00
template: src=ntp.conf.j2 dest=/etc/ntp.conf
2013-02-19 05:29:27 +01:00
notify:
- restart ntpd
tags: ntp
2012-08-01 06:52:48 +02:00
2013-02-23 19:13:26 +01:00
- name: be sure ntpd is running and enabled
service: name=ntpd state=running enabled=yes
2013-02-19 05:29:27 +01:00
tags: ntp
Here is an example handlers file. As a review, handlers are only fired when certain tasks report changes, and are run at the end
of each play::
---
2013-04-13 00:26:17 +02:00
# file: roles/common/handlers/main.yml
2013-02-23 19:13:26 +01:00
- name: restart ntpd
service: name=ntpd state=restarted
2013-02-19 05:29:27 +01:00
2013-10-05 20:57:45 +02:00
See :doc: `playbooks_roles` for more information.
2013-10-05 00:34:39 +02:00
.. _organization_examples:
2013-02-19 05:29:27 +01:00
What This Organization Enables (Examples)
`` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ``
2013-10-05 20:57:45 +02:00
Above we've shared our basic organizational structure.
2013-02-19 05:29:27 +01:00
Now what sort of use cases does this layout enable? Lots! If I want to reconfigure my whole infrastructure, it's just::
ansible-playbook -i production site.yml
What about just reconfiguring NTP on everything? Easy.::
ansible-playbook -i production site.yml --tags ntp
What about just reconfiguring my webservers?::
ansible-playbook -i production webservers.yml
What about just my webservers in Boston?::
ansible-playbook -i production webservers.yml --limit boston
What about just the first 10, and then the next 10?::
ansible-playbook -i production webservers.yml --limit boston[0-10]
ansible-playbook -i production webservers.yml --limit boston[10-20]
And of course just basic ad-hoc stuff is also possible.::
2013-03-28 16:39:06 +01:00
ansible -i production -m ping
ansible -i production -m command -a '/sbin/reboot' --limit boston
2013-02-19 05:29:27 +01:00
2013-07-16 22:53:20 +02:00
And there are some useful commands to know (at least in 1.1 and higher)::
2013-02-19 05:29:27 +01:00
# confirm what task names would be run if I ran this command and said "just ntp tasks"
ansible-playbook -i production webservers.yml --tags ntp --list-tasks
# confirm what hostnames might be communicated with if I said "limit to boston"
ansible-playbook -i production webservers.yml --limit boston --list-hosts
2013-10-05 00:34:39 +02:00
.. _dep_vs_config:
2013-02-19 05:29:27 +01:00
Deployment vs Configuration Organization
`` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ` ``
2013-10-05 20:57:45 +02:00
The above setup models a typical configuration topology. When doing multi-tier deployments, there are going
2013-02-19 05:29:27 +01:00
to be some additional playbooks that hop between tiers to roll out an application. In this case, 'site.yml'
may be augmented by playbooks like 'deploy_exampledotcom.yml' but the general concepts can still apply.
2013-10-05 20:57:45 +02:00
Consider "playbooks" as a sports metaphor -- you don't have to just have one set of plays to use against your infrastructure
all the time -- you can have situational plays that you use at different times and for different purposes.
2013-02-19 05:29:27 +01:00
Ansible allows you to deploy and configure using the same tool, so you would likely reuse groups and just
2013-04-21 21:33:51 +02:00
keep the OS configuration in separate playbooks from the app deployment.
2013-02-19 05:29:27 +01:00
2013-10-05 02:04:31 +02:00
.. _stage_vs_production:
2013-10-05 00:34:39 +02:00
2013-02-19 05:29:27 +01:00
Stage vs Production
+++++++++++++++++++
2013-07-16 22:53:20 +02:00
As also mentioned above, a good way to keep your stage (or testing) and production environments separate is to use a separate inventory file for stage and production. This way you pick with -i what you are targeting. Keeping them all in one file can lead to surprises!
2013-02-19 05:29:27 +01:00
Testing things in a stage environment before trying in production is always a great idea. Your environments need not be the same
size and you can use group variables to control the differences between those environments.
2013-10-05 00:34:39 +02:00
.. _rolling_update:
2013-02-19 05:29:27 +01:00
Rolling Updates
+++++++++++++++
Understand the 'serial' keyword. If updating a webserver farm you really want to use it to control how many machines you are
updating at once in the batch.
2013-10-05 20:57:45 +02:00
See :doc: `playbooks_delegation` .
2013-10-05 00:34:39 +02:00
.. _mention_the_state:
2013-02-19 05:29:27 +01:00
Always Mention The State
++++++++++++++++++++++++
The 'state' parameter is optional to a lot of modules. Whether 'state=present' or 'state=absent', it's always best to leave that
parameter in your playbooks to make it clear, especially as some modules support additional states.
2012-08-01 06:52:48 +02:00
2013-10-05 00:34:39 +02:00
.. _group_by_roles:
2012-05-13 17:56:09 +02:00
Group By Roles
++++++++++++++
2013-09-30 02:09:30 +02:00
A system can be in multiple groups. See :doc: `intro_inventory` and :doc: `intro_patterns` . Having groups named after things like
2012-08-07 04:21:23 +02:00
*webservers* and *dbservers* is repeated in the examples because it's a very powerful concept.
2012-05-13 17:56:09 +02:00
This allows playbooks to target machines based on role, as well as to assign role specific variables
using the group variable system.
2013-10-05 20:57:45 +02:00
See :doc: `playbooks_roles` .
2013-10-05 00:34:39 +02:00
.. _os_variance:
2013-02-19 05:29:27 +01:00
Operating System and Distribution Variance
++++++++++++++++++++++++++++++++++++++++++
When dealing with a parameter that is different between two different operating systems, the best way to handle this is
by using the group_by module.
This makes a dynamic group of hosts matching certain criteria, even if that group is not defined in the inventory file::
---
# talk to all hosts just so we can learn about them
- hosts: all
tasks:
2013-08-23 06:29:13 +02:00
- group_by: key={{ ansible_distribution }}
2013-02-19 05:29:27 +01:00
# now just on the CentOS hosts...
2012-05-13 17:56:09 +02:00
2013-02-19 05:29:27 +01:00
- hosts: CentOS
gather_facts: False
tasks:
- # tasks that only happen on CentOS go here
2013-07-16 22:53:20 +02:00
If group-specific settings are needed, this can also be done. For example::
2012-05-13 17:56:09 +02:00
2012-08-07 04:21:23 +02:00
---
2013-02-19 05:29:27 +01:00
# file: group_vars/all
asdf: 10
2012-05-13 17:56:09 +02:00
2013-02-19 05:29:27 +01:00
---
# file: group_vars/CentOS
asdf: 42
2012-05-13 17:56:09 +02:00
2013-07-16 22:53:20 +02:00
In the above example, CentOS machines get the value of '42' for asdf, but other machines get '10'.
2012-05-13 17:56:09 +02:00
2013-10-05 00:34:39 +02:00
.. _ship_modules_with_playbooks:
2012-05-13 17:56:09 +02:00
2012-07-04 23:44:39 +02:00
Bundling Ansible Modules With Playbooks
+++++++++++++++++++++++++++++++++++++++
2012-08-07 04:32:40 +02:00
.. versionadded :: 0.5
2013-07-16 22:53:20 +02:00
If a playbook has a "./library" directory relative to its YAML file, this directory can be used to add ansible modules that will
2013-02-19 05:29:27 +01:00
automatically be in the ansible module path. This is a great way to keep modules that go with a playbook together.
2013-10-05 00:34:39 +02:00
.. _whitespace:
2013-02-19 05:29:27 +01:00
Whitespace and Comments
+++++++++++++++++++++++
2012-07-04 23:44:39 +02:00
2013-02-19 05:29:27 +01:00
Generous use of whitespace to break things up, and use of comments (which start with '#'), is encouraged.
2013-10-05 00:34:39 +02:00
.. _name_tasks:
2013-02-19 05:29:27 +01:00
Always Name Tasks
+++++++++++++++++
It is possible to leave off the 'name' for a given task, though it is recommended to provide a description
about why something is being done instead. This name is shown when the playbook is run.
2013-10-05 00:34:39 +02:00
.. _keep_it_simple:
2013-02-19 05:29:27 +01:00
Keep It Simple
++++++++++++++
2012-05-13 17:56:09 +02:00
2012-08-07 04:21:23 +02:00
When you can do something simply, do something simply. Do not reach
to use every feature of Ansible together, all at once. Use what works
2013-07-16 22:53:20 +02:00
for you. For example, you will probably not need 'vars',
2012-08-07 04:21:23 +02:00
'vars_files', 'vars_prompt' and '--extra-vars' all at once,
while also using an external inventory file.
2013-10-05 00:34:39 +02:00
.. _version_control:
2013-02-19 05:29:27 +01:00
Version Control
+++++++++++++++
2012-08-07 04:21:23 +02:00
Use version control. Keep your playbooks and inventory file in git
(or another version control system), and commit when you make changes
to them. This way you have an audit trail describing when and why you
2013-07-16 22:53:20 +02:00
changed the rules that are automating your infrastructure.
2012-08-07 04:21:23 +02:00
2012-05-13 17:56:09 +02:00
.. seealso ::
:doc: `YAMLSyntax`
Learn about YAML syntax
:doc: `playbooks`
Review the basic playbook features
:doc: `modules`
Learn about available modules
2013-09-30 02:09:30 +02:00
:doc: `developing_modules`
2012-05-13 17:56:09 +02:00
Learn how to extend Ansible by writing your own modules
2013-09-30 02:09:30 +02:00
:doc: `intro_patterns`
2012-05-13 17:56:09 +02:00
Learn about how to select hosts
2012-08-07 04:00:50 +02:00
`Github examples directory <https://github.com/ansible/ansible/tree/devel/examples/playbooks> `_
2012-05-13 17:56:09 +02:00
Complete playbook files from the github project source
`Mailing List <http://groups.google.com/group/ansible-project> `_
Questions? Help? Ideas? Stop by the list on Google Groups