Statistics
| Branch: | Tag: | Revision:

root / UPGRADE @ 82b948e4

History | View | Annotate | Download (10.7 kB)

1
Upgrade notes
2
=============
3

    
4
.. highlight:: shell-example
5

    
6
This document details the steps needed to upgrade a cluster to newer versions
7
of Ganeti.
8

    
9
As a general rule the node daemons need to be restarted after each software
10
upgrade; if using the provided example init.d script, this means running the
11
following command on all nodes::
12

    
13
    $ /etc/init.d/ganeti restart
14

    
15

    
16
2.1 and above
17
-------------
18

    
19
Starting with Ganeti 2.0, upgrades between revisions (e.g. 2.1.0 to 2.1.1)
20
should not need manual intervention. As a safety measure, minor releases (e.g.
21
2.1.3 to 2.2.0) require the ``cfgupgrade`` command for changing the
22
configuration version. Below you find the steps necessary to upgrade between
23
minor releases.
24

    
25
To run commands on all nodes, the `distributed shell (dsh)
26
<http://www.netfort.gr.jp/~dancer/software/dsh.html.en>`_ can be used, e.g.
27
``dsh -M -F 8 -f /var/lib/ganeti/ssconf_online_nodes gnt-cluster --version``.
28

    
29
#. Ensure no jobs are running (master node only)::
30

    
31
    $ gnt-job list
32

    
33
#. Pause the watcher for an hour (master node only)::
34

    
35
    $ gnt-cluster watcher pause 1h
36

    
37
#. Stop all daemons on all nodes::
38

    
39
    $ /etc/init.d/ganeti stop
40

    
41
#. Backup old configuration (master node only)::
42

    
43
    $ tar czf /var/lib/ganeti-$(date +\%FT\%T).tar.gz -C /var/lib ganeti
44

    
45
#. Install new Ganeti version on all nodes
46
#. Run cfgupgrade on the master node::
47

    
48
    $ /usr/lib/ganeti/tools/cfgupgrade --verbose --dry-run
49
    $ /usr/lib/ganeti/tools/cfgupgrade --verbose
50

    
51
   (``cfgupgrade`` supports a number of parameters, run it with
52
   ``--help`` for more information)
53

    
54
#. Upgrade the directory permissions on all nodes::
55

    
56
    $ /usr/lib/ganeti/ensure-dirs --full-run
57

    
58
#. Create the (missing) required users and make users part of the required
59
   groups on all nodes::
60

    
61
    $ /usr/lib/ganeti/tools/users-setup
62

    
63
   This will ask for confirmation. To execute directly, add the ``--yes-do-it``
64
   option.
65

    
66
#. Restart daemons on all nodes::
67

    
68
    $ /etc/init.d/ganeti restart
69

    
70
#. Re-distribute configuration (master node only)::
71

    
72
    $ gnt-cluster redist-conf
73

    
74
#. If you use file storage, check that the ``/etc/ganeti/file-storage-paths``
75
   is correct on all nodes. For security reasons it's not copied
76
   automatically, but it can be copied manually via::
77

    
78
   $ gnt-cluster copyfile /etc/ganeti/file-storage-paths
79

    
80
#. Restart daemons again on all nodes::
81

    
82
    $ /etc/init.d/ganeti restart
83

    
84
#. Enable the watcher again (master node only)::
85

    
86
    $ gnt-cluster watcher continue
87

    
88
#. Verify cluster (master node only)::
89

    
90
    $ gnt-cluster verify
91

    
92
Reverting an upgrade
93
~~~~~~~~~~~~~~~~~~~~
94

    
95
For going back between revisions (e.g. 2.1.1 to 2.1.0) no manual
96
intervention is required, as for upgrades.
97

    
98
Starting from version 2.8, ``cfgupgrade`` supports ``--downgrade``
99
option to bring the configuration back to the previous stable version.
100
This is useful if you upgrade Ganeti and after some time you run into
101
problems with the new version. You can downgrade the configuration
102
without losing the changes made since the upgrade. Any feature not
103
supported by the old version will be removed from the configuration, of
104
course, but you get a warning about it. If there is any new feature and
105
you haven't changed from its default value, you don't have to worry
106
about it, as it will get the same value whenever you'll upgrade again.
107

    
108
The procedure is similar to upgrading, but please notice that you have to
109
revert the configuration **before** installing the old version.
110

    
111
#. Ensure no jobs are running (master node only)::
112

    
113
    $ gnt-job list
114

    
115
#. Pause the watcher for an hour (master node only)::
116

    
117
    $ gnt-cluster watcher pause 1h
118

    
119
#. Stop all daemons on all nodes::
120

    
121
    $ /etc/init.d/ganeti stop
122

    
123
#. Backup old configuration (master node only)::
124

    
125
    $ tar czf /var/lib/ganeti-$(date +\%FT\%T).tar.gz -C /var/lib ganeti
126

    
127
#. Run cfgupgrade on the master node::
128

    
129
    $ /usr/lib/ganeti/tools/cfgupgrade --verbose --downgrade --dry-run
130
    $ /usr/lib/ganeti/tools/cfgupgrade --verbose --downgrade
131

    
132
   You may want to copy all the messages about features that have been
133
   removed during the downgrade, in case you want to restore them when
134
   upgrading again.
135

    
136
#. Install the old Ganeti version on all nodes
137

    
138
   NB: in Ganeti 2.8, the ``cmdlib.py`` file was split into a series of files
139
   contained in the ``cmdlib`` directory. If Ganeti is installed from sources
140
   and not from a package, while downgrading Ganeti to a pre-2.8
141
   version it is important to remember to remove the ``cmdlib`` directory
142
   from the directory containing the Ganeti python files (which usually is
143
   ``${PREFIX}/lib/python${VERSION}/dist-packages/ganeti``).
144
   A simpler upgrade/downgrade procedure will be made available in future
145
   versions of Ganeti.
146

    
147
#. Restart daemons on all nodes::
148

    
149
    $ /etc/init.d/ganeti restart
150

    
151
#. Re-distribute configuration (master node only)::
152

    
153
    $ gnt-cluster redist-conf
154

    
155
#. Restart daemons again on all nodes::
156

    
157
    $ /etc/init.d/ganeti restart
158

    
159
#. Enable the watcher again (master node only)::
160

    
161
    $ gnt-cluster watcher continue
162

    
163
#. Verify cluster (master node only)::
164

    
165
    $ gnt-cluster verify
166

    
167

    
168
2.0 releases
169
------------
170

    
171
2.0.3 to 2.0.4
172
~~~~~~~~~~~~~~
173

    
174
No changes needed except restarting the daemon; but rollback to 2.0.3 might
175
require configuration editing.
176

    
177
If you're using Xen-HVM instances, please double-check the network
178
configuration (``nic_type`` parameter) as the defaults might have changed:
179
2.0.4 adds any missing configuration items and depending on the version of the
180
software the cluster has been installed with, some new keys might have been
181
added.
182

    
183
2.0.1 to 2.0.2/2.0.3
184
~~~~~~~~~~~~~~~~~~~~
185

    
186
Between 2.0.1 and 2.0.2 there have been some changes in the handling of block
187
devices, which can cause some issues. 2.0.3 was then released which adds two
188
new options/commands to fix this issue.
189

    
190
If you use DRBD-type instances and see problems in instance start or
191
activate-disks with messages from DRBD about "lower device too small" or
192
similar, it is recoomended to:
193

    
194
#. Run ``gnt-instance activate-disks --ignore-size $instance`` for each
195
   of the affected instances
196
#. Then run ``gnt-cluster repair-disk-sizes`` which will check that
197
   instances have the correct disk sizes
198

    
199
1.2 to 2.0
200
----------
201

    
202
Prerequisites:
203

    
204
- Ganeti 1.2.7 is currently installed
205
- All instances have been migrated from DRBD 0.7 to DRBD 8.x (i.e. no
206
  ``remote_raid1`` disk template)
207
- Upgrade to Ganeti 2.0.0~rc2 or later (~rc1 and earlier don't have the needed
208
  upgrade tool)
209

    
210
In the below steps, replace :file:`/var/lib` with ``$libdir`` if Ganeti was not
211
installed with this prefix (e.g. :file:`/usr/local/var`). Same for
212
:file:`/usr/lib`.
213

    
214
Execution (all steps are required in the order given):
215

    
216
#. Make a backup of the current configuration, for safety::
217

    
218
    $ cp -a /var/lib/ganeti /var/lib/ganeti-1.2.backup
219

    
220
#. Stop all instances::
221

    
222
    $ gnt-instance stop --all
223

    
224
#. Make sure no DRBD device are in use, the following command should show no
225
   active minors::
226

    
227
    $ gnt-cluster command grep cs: /proc/drbd | grep -v cs:Unconf
228

    
229
#. Stop the node daemons and rapi daemon on all nodes (note: should be logged
230
   in not via the cluster name, but the master node name, as the command below
231
   will remove the cluster ip from the master node)::
232

    
233
    $ gnt-cluster command /etc/init.d/ganeti stop
234

    
235
#. Install the new software on all nodes, either from packaging (if available)
236
   or from sources; the master daemon will not start but give error messages
237
   about wrong configuration file, which is normal
238
#. Upgrade the configuration file::
239

    
240
    $ /usr/lib/ganeti/tools/cfgupgrade12 -v --dry-run
241
    $ /usr/lib/ganeti/tools/cfgupgrade12 -v
242

    
243
#. Make sure ``ganeti-noded`` is running on all nodes (and start it if
244
   not)
245
#. Start the master daemon::
246

    
247
    $ ganeti-masterd
248

    
249
#. Check that a simple node-list works::
250

    
251
    $ gnt-node list
252

    
253
#. Redistribute updated configuration to all nodes::
254

    
255
    $ gnt-cluster redist-conf
256
    $ gnt-cluster copyfile /var/lib/ganeti/known_hosts
257

    
258
#. Optional: if needed, install RAPI-specific certificates under
259
   :file:`/var/lib/ganeti/rapi.pem` and run::
260

    
261
    $ gnt-cluster copyfile /var/lib/ganeti/rapi.pem
262

    
263
#. Run a cluster verify, this should show no problems::
264

    
265
    $ gnt-cluster verify
266

    
267
#. Remove some obsolete files::
268

    
269
    $ gnt-cluster command rm /var/lib/ganeti/ssconf_node_pass
270
    $ gnt-cluster command rm /var/lib/ganeti/ssconf_hypervisor
271

    
272
#. Update the xen pvm (if this was a pvm cluster) setting for 1.2
273
   compatibility::
274

    
275
    $ gnt-cluster modify -H xen-pvm:root_path=/dev/sda
276

    
277
#. Depending on your setup, you might also want to reset the initrd parameter::
278

    
279
    $ gnt-cluster modify -H xen-pvm:initrd_path=/boot/initrd-2.6-xenU
280

    
281
#. Reset the instance autobalance setting to default::
282

    
283
    $ for i in $(gnt-instance list -o name --no-headers); do \
284
       gnt-instance modify -B auto_balance=default $i; \
285
      done
286

    
287
#. Optional: start the RAPI demon::
288

    
289
    $ ganeti-rapi
290

    
291
#. Restart instances::
292

    
293
    $ gnt-instance start --force-multiple --all
294

    
295
At this point, ``gnt-cluster verify`` should show no errors and the migration
296
is complete.
297

    
298
1.2 releases
299
------------
300

    
301
1.2.4 to any other higher 1.2 version
302
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
303

    
304
No changes needed. Rollback will usually require manual edit of the
305
configuration file.
306

    
307
1.2.3 to 1.2.4
308
~~~~~~~~~~~~~~
309

    
310
No changes needed. Note that going back from 1.2.4 to 1.2.3 will require manual
311
edit of the configuration file (since we added some HVM-related new
312
attributes).
313

    
314
1.2.2 to 1.2.3
315
~~~~~~~~~~~~~~
316

    
317
No changes needed. Note that the drbd7-to-8 upgrade tool does a disk format
318
change for the DRBD metadata, so in theory this might be **risky**. It is
319
advised to have (good) backups before doing the upgrade.
320

    
321
1.2.1 to 1.2.2
322
~~~~~~~~~~~~~~
323

    
324
No changes needed.
325

    
326
1.2.0 to 1.2.1
327
~~~~~~~~~~~~~~
328

    
329
No changes needed. Only some bugfixes and new additions that don't affect
330
existing clusters.
331

    
332
1.2.0 beta 3 to 1.2.0
333
~~~~~~~~~~~~~~~~~~~~~
334

    
335
No changes needed.
336

    
337
1.2.0 beta 2 to beta 3
338
~~~~~~~~~~~~~~~~~~~~~~
339

    
340
No changes needed. A new version of the debian-etch-instance OS (0.3) has been
341
released, but upgrading it is not required.
342

    
343
1.2.0 beta 1 to beta 2
344
~~~~~~~~~~~~~~~~~~~~~~
345

    
346
Beta 2 switched the config file format to JSON. Steps to upgrade:
347

    
348
#. Stop the daemons (``/etc/init.d/ganeti stop``) on all nodes
349
#. Disable the cron job (default is :file:`/etc/cron.d/ganeti`)
350
#. Install the new version
351
#. Make a backup copy of the config file
352
#. Upgrade the config file using the following command::
353

    
354
    $ /usr/share/ganeti/cfgupgrade --verbose /var/lib/ganeti/config.data
355

    
356
#. Start the daemons and run ``gnt-cluster info``, ``gnt-node list`` and
357
   ``gnt-instance list`` to check if the upgrade process finished successfully
358

    
359
The OS definition also need to be upgraded. There is a new version of the
360
debian-etch-instance OS (0.2) that goes along with beta 2.
361

    
362
.. vim: set textwidth=72 :
363
.. Local Variables:
364
.. mode: rst
365
.. fill-column: 72
366
.. End: