Statistics
| Branch: | Tag: | Revision:

root / man / gnt-cluster.rst @ 6429e8d8

History | View | Annotate | Download (21.8 kB)

1
gnt-cluster(8) Ganeti | Version @GANETI_VERSION@
2
================================================
3

    
4
Name
5
----
6

    
7
gnt-cluster - Ganeti administration, cluster-wide
8

    
9
Synopsis
10
--------
11

    
12
**gnt-cluster** {command} [arguments...]
13

    
14
DESCRIPTION
15
-----------
16

    
17
The **gnt-cluster** is used for cluster-wide administration in the
18
Ganeti system.
19

    
20
COMMANDS
21
--------
22

    
23
ADD-TAGS
24
~~~~~~~~
25

    
26
**add-tags** [--from *file*] {*tag*...}
27

    
28
Add tags to the cluster. If any of the tags contains invalid
29
characters, the entire operation will abort.
30

    
31
If the ``--from`` option is given, the list of tags will be
32
extended with the contents of that file (each line becomes a tag).
33
In this case, there is not need to pass tags on the command line
34
(if you do, both sources will be used). A file name of - will be
35
interpreted as stdin.
36

    
37
COMMAND
38
~~~~~~~
39

    
40
**command** [-n *node*] [-g *group*] {*command*}
41

    
42
Executes a command on all nodes. If the option ``-n`` is not given,
43
the command will be executed on all nodes, otherwise it will be
44
executed only on the node(s) specified. Use the option multiple
45
times for running it on multiple nodes, like::
46

    
47
    # gnt-cluster command -n node1.example.com -n node2.example.com date
48

    
49
The ``-g`` option can be used to run a command only on a specific node
50
group, e.g.::
51

    
52
    # gnt-cluster command -g default date
53

    
54
The command is executed serially on the selected nodes. If the
55
master node is present in the list, the command will be executed
56
last on the master. Regarding the other nodes, the execution order
57
is somewhat alphabetic, so that node2.example.com will be earlier
58
than node10.example.com but after node1.example.com.
59

    
60
So given the node names node1, node2, node3, node10, node11, with
61
node3 being the master, the order will be: node1, node2, node10,
62
node11, node3.
63

    
64
The command is constructed by concatenating all other command line
65
arguments. For example, to list the contents of the /etc directory
66
on all nodes, run::
67

    
68
    # gnt-cluster command ls -l /etc
69

    
70
and the command which will be executed will be ``ls -l /etc``.
71

    
72
COPYFILE
73
~~~~~~~~
74

    
75
| **copyfile** [--use-replication-network] [-n *node*] [-g *group*]
76
| {*file*}
77

    
78
Copies a file to all or to some nodes. The argument specifies the
79
source file (on the current system), the ``-n`` argument specifies
80
the target node, or nodes if the option is given multiple times. If
81
``-n`` is not given at all, the file will be copied to all nodes. The
82
``-g`` option can be used to only select nodes in a specific node group.
83
Passing the ``--use-replication-network`` option will cause the
84
copy to be done over the replication network (only matters if the
85
primary/secondary IPs are different). Example::
86

    
87
    # gnt-cluster -n node1.example.com -n node2.example.com copyfile /tmp/test
88

    
89
This will copy the file /tmp/test from the current node to the two
90
named nodes.
91

    
92
DESTROY
93
~~~~~~~
94

    
95
**destroy** {--yes-do-it}
96

    
97
Remove all configuration files related to the cluster, so that a
98
**gnt-cluster init** can be done again afterwards.
99

    
100
Since this is a dangerous command, you are required to pass the
101
argument *--yes-do-it.*
102

    
103
EPO
104
~~~
105

    
106
**epo** [--on] [--groups|--all] [--power-delay] *arguments*
107

    
108
Performs an emergency power-off on nodes given as arguments. If
109
``--groups`` is given, arguments are node groups. If ``--all`` is
110
provided, the whole cluster will be shut down.
111

    
112
The ``--on`` flag recovers the cluster after an emergency power-off.
113
When powering on the cluster you can use ``--power-delay`` to define the
114
time in seconds (fractions allowed) waited between powering on
115
individual nodes.
116

    
117
Please note that the master node will not be turned down or up
118
automatically.  It will just be left in a state, where you can manully
119
perform the shutdown of that one node. If the master is in the list of
120
affected nodes and this is not a complete cluster emergency power-off
121
(e.g. using ``--all``), you're required to do a master failover to
122
another node not affected.
123

    
124
GETMASTER
125
~~~~~~~~~
126

    
127
**getmaster**
128

    
129
Displays the current master node.
130

    
131
INFO
132
~~~~
133

    
134
**info** [--roman]
135

    
136
Shows runtime cluster information: cluster name, architecture (32
137
or 64 bit), master node, node list and instance list.
138

    
139
Passing the ``--roman`` option gnt-cluster info will try to print
140
its integer fields in a latin friendly way. This allows further
141
diffusion of Ganeti among ancient cultures.
142

    
143
INIT
144
~~~~
145

    
146
| **init**
147
| [{-s|--secondary-ip} *secondary\_ip*]
148
| [--vg-name *vg-name*]
149
| [--master-netdev *interface-name*]
150
| [{-m|--mac-prefix} *mac-prefix*]
151
| [--no-lvm-storage]
152
| [--no-etc-hosts]
153
| [--no-ssh-init]
154
| [--file-storage-dir *dir*]
155
| [--enabled-hypervisors *hypervisors*]
156
| [-t *hypervisor name*]
157
| [{-H|--hypervisor-parameters} *hypervisor*:*hv-param*=*value*[,*hv-param*=*value*...]]
158
| [{-B|--backend-parameters} *be-param*=*value* [,*be-param*=*value*...]]
159
| [{-N|--nic-parameters} *nic-param*=*value* [,*nic-param*=*value*...]]
160
| [--maintain-node-health {yes \| no}]
161
| [--uid-pool *user-id pool definition*]
162
| [{-I|--default-iallocator} *default instance allocator*]
163
| [--primary-ip-version *version*]
164
| [--prealloc-wipe-disks {yes \| no}]
165
| [--node-parameters *ndparams*]
166
| [{-C|--candidate-pool-size} *candidate\_pool\_size*]
167
| {*clustername*}
168

    
169
This commands is only run once initially on the first node of the
170
cluster. It will initialize the cluster configuration, setup the
171
ssh-keys, start the daemons on the master node, etc. in order to have
172
a working one-node cluster.
173

    
174
Note that the *clustername* is not any random name. It has to be
175
resolvable to an IP address using DNS, and it is best if you give the
176
fully-qualified domain name. This hostname must resolve to an IP
177
address reserved exclusively for this purpose, i.e. not already in
178
use.
179

    
180
The cluster can run in two modes: single-home or dual-homed. In the
181
first case, all traffic (both public traffic, inter-node traffic and
182
data replication traffic) goes over the same interface. In the
183
dual-homed case, the data replication traffic goes over the second
184
network. The ``-s (--secondary-ip)`` option here marks the cluster as
185
dual-homed and its parameter represents this node's address on the
186
second network.  If you initialise the cluster with ``-s``, all nodes
187
added must have a secondary IP as well.
188

    
189
Note that for Ganeti it doesn't matter if the secondary network is
190
actually a separate physical network, or is done using tunneling,
191
etc. For performance reasons, it's recommended to use a separate
192
network, of course.
193

    
194
The ``--vg-name`` option will let you specify a volume group
195
different than "xenvg" for Ganeti to use when creating instance
196
disks. This volume group must have the same name on all nodes. Once
197
the cluster is initialized this can be altered by using the
198
**modify** command. If you don't want to use lvm storage at all use
199
the ``--no-lvm-storage`` option. Once the cluster is initialized
200
you can change this setup with the **modify** command.
201

    
202
The ``--master-netdev`` option is useful for specifying a different
203
interface on which the master will activate its IP address. It's
204
important that all nodes have this interface because you'll need it
205
for a master failover.
206

    
207
The ``-m (--mac-prefix)`` option will let you specify a three byte
208
prefix under which the virtual MAC addresses of your instances will be
209
generated. The prefix must be specified in the format ``XX:XX:XX`` and
210
the default is ``aa:00:00``.
211

    
212
The ``--no-lvm-storage`` option allows you to initialize the
213
cluster without lvm support. This means that only instances using
214
files as storage backend will be possible to create. Once the
215
cluster is initialized you can change this setup with the
216
**modify** command.
217

    
218
The ``--no-etc-hosts`` option allows you to initialize the cluster
219
without modifying the /etc/hosts file.
220

    
221
The ``--no-ssh-init`` option allows you to initialize the cluster
222
without creating or distributing SSH key pairs.
223

    
224
The ``--file-storage-dir`` option allows you set the directory to
225
use for storing the instance disk files when using file storage as
226
backend for instance disks.
227

    
228
The ``--prealloc-wipe-disks`` sets a cluster wide configuration
229
value for wiping disks prior to allocation. This increases security
230
on instance level as the instance can't access untouched data from
231
it's underlying storage.
232

    
233
The ``--enabled-hypervisors`` option allows you to set the list of
234
hypervisors that will be enabled for this cluster. Instance
235
hypervisors can only be chosen from the list of enabled
236
hypervisors, and the first entry of this list will be used by
237
default. Currently, the following hypervisors are available:
238

    
239
xen-pvm
240
    Xen PVM hypervisor
241

    
242
xen-hvm
243
    Xen HVM hypervisor
244

    
245
kvm
246
    Linux KVM hypervisor
247

    
248
chroot
249
    a simple chroot manager that starts chroot based on a script at the
250
    root of the filesystem holding the chroot
251

    
252
fake
253
    fake hypervisor for development/testing
254

    
255
Either a single hypervisor name or a comma-separated list of
256
hypervisor names can be specified. If this option is not specified,
257
only the xen-pvm hypervisor is enabled by default.
258

    
259
The ``-H (--hypervisor-parameters)`` option allows you to set default
260
hypervisor specific parameters for the cluster. The format of this
261
option is the name of the hypervisor, followed by a colon and a
262
comma-separated list of key=value pairs. The keys available for each
263
hypervisors are detailed in the gnt-instance(8) man page, in the
264
**add** command plus the following parameters which are only
265
configurable globally (at cluster level):
266

    
267
migration\_port
268
    Valid for the Xen PVM and KVM hypervisors.
269

    
270
    This options specifies the TCP port to use for live-migration. For
271
    Xen, the same port should be configured on all nodes in the
272
    ``/etc/xen/xend-config.sxp`` file, under the key
273
    "xend-relocation-port".
274

    
275
migration\_bandwidth
276
    Valid for the KVM hypervisor.
277

    
278
    This option specifies the maximum bandwidth that KVM will use for
279
    instance live migrations. The value is in MiB/s.
280

    
281
    This option is only effective with kvm versions >= 78 and qemu-kvm
282
    versions >= 0.10.0.
283

    
284
The ``-B (--backend-parameters)`` option allows you to set the default
285
backend parameters for the cluster. The parameter format is a
286
comma-separated list of key=value pairs with the following supported
287
keys:
288

    
289
vcpus
290
    Number of VCPUs to set for an instance by default, must be an
291
    integer, will be set to 1 if no specified.
292

    
293
memory
294
    Amount of memory to allocate for an instance by default, can be
295
    either an integer or an integer followed by a unit (M for mebibytes
296
    and G for gibibytes are supported), will be set to 128M if not
297
    specified.
298

    
299
auto\_balance
300
    Value of the auto\_balance flag for instances to use by default,
301
    will be set to true if not specified.
302

    
303

    
304
The ``-N (--nic-parameters)`` option allows you to set the default nic
305
parameters for the cluster. The parameter format is a comma-separated
306
list of key=value pairs with the following supported keys:
307

    
308
mode
309
    The default nic mode, 'routed' or 'bridged'.
310

    
311
link
312
    In bridged mode the default NIC bridge. In routed mode it
313
    represents an hypervisor-vif-script dependent value to allow
314
    different instance groups. For example under the KVM default
315
    network script it is interpreted as a routing table number or
316
    name.
317

    
318
The option ``--maintain-node-health`` allows one to enable/disable
319
automatic maintenance actions on nodes. Currently these include
320
automatic shutdown of instances and deactivation of DRBD devices on
321
offline nodes; in the future it might be extended to automatic
322
removal of unknown LVM volumes, etc.
323

    
324
The ``--uid-pool`` option initializes the user-id pool. The
325
*user-id pool definition* can contain a list of user-ids and/or a
326
list of user-id ranges. The parameter format is a comma-separated
327
list of numeric user-ids or user-id ranges. The ranges are defined
328
by a lower and higher boundary, separated by a dash. The boundaries
329
are inclusive. If the ``--uid-pool`` option is not supplied, the
330
user-id pool is initialized to an empty list. An empty list means
331
that the user-id pool feature is disabled.
332

    
333
The ``-I (--default-iallocator)`` option specifies the default
334
instance allocator. The instance allocator will be used for operations
335
like instance creation, instance and node migration, etc. when no
336
manual override is specified. If this option is not specified and
337
htools was not enabled at build time, the default instance allocator
338
will be blank, which means that relevant operations will require the
339
administrator to manually specify either an instance allocator, or a
340
set of nodes. If the option is not specified but htools was enabled,
341
the default iallocator will be **hail**(1) (assuming it can be found
342
on disk). The default iallocator can be changed later using the
343
**modify** command.
344

    
345
The ``--primary-ip-version`` option specifies the IP version used
346
for the primary address. Possible values are 4 and 6 for IPv4 and
347
IPv6, respectively. This option is used when resolving node names
348
and the cluster name.
349

    
350
The ``--node-parameters`` option allows you to set default node
351
parameters for the cluster. Please see **ganeti**(7) for more
352
information about supported key=value pairs.
353

    
354
The ``-C (--candidate-pool-size)`` option specifies the
355
``candidate_pool_size`` cluster parameter. This is the number of nodes
356
that the master will try to keep as master\_candidates. For more
357
details about this role and other node roles, see the ganeti(7).
358

    
359
LIST-TAGS
360
~~~~~~~~~
361

    
362
**list-tags**
363

    
364
List the tags of the cluster.
365

    
366
MASTER-FAILOVER
367
~~~~~~~~~~~~~~~
368

    
369
**master-failover** [--no-voting]
370

    
371
Failover the master role to the current node.
372

    
373
The ``--no-voting`` option skips the remote node agreement checks.
374
This is dangerous, but necessary in some cases (for example failing
375
over the master role in a 2 node cluster with the original master
376
down). If the original master then comes up, it won't be able to
377
start its master daemon because it won't have enough votes, but so
378
won't the new master, if the master daemon ever needs a restart.
379
You can pass ``--no-voting`` to **ganeti-masterd** on the new
380
master to solve this problem, and run **gnt-cluster redist-conf**
381
to make sure the cluster is consistent again.
382

    
383
MASTER-PING
384
~~~~~~~~~~~
385

    
386
**master-ping**
387

    
388
Checks if the master daemon is alive.
389

    
390
If the master daemon is alive and can respond to a basic query (the
391
equivalent of **gnt-cluster info**), then the exit code of the
392
command will be 0. If the master daemon is not alive (either due to
393
a crash or because this is not the master node), the exit code will
394
be 1.
395

    
396
MODIFY
397
~~~~~~
398

    
399
| **modify**
400
| [--vg-name *vg-name*]
401
| [--no-lvm-storage]
402
| [--enabled-hypervisors *hypervisors*]
403
| [{-H|--hypervisor-parameters} *hypervisor*:*hv-param*=*value*[,*hv-param*=*value*...]]
404
| [{-B|--backend-parameters} *be-param*=*value* [,*be-param*=*value*...]]
405
| [{-N|--nic-parameters} *nic-param*=*value* [,*nic-param*=*value*...]]
406
| [--uid-pool *user-id pool definition*]
407
| [--add-uids *user-id pool definition*]
408
| [--remove-uids *user-id pool definition*]
409
| [{-C|--candidate-pool-size} *candidate\_pool\_size*]
410
| [--maintain-node-health {yes \| no}]
411
| [--prealloc-wipe-disks {yes \| no}]
412
| [{-I|--default-iallocator} *default instance allocator*]
413
| [--reserved-lvs=*NAMES*]
414
| [--node-parameters *ndparams*]
415
| [--master-netdev *interface-name*]
416

    
417
Modify the options for the cluster.
418

    
419
The ``--vg-name``, ``--no-lvm-storarge``, ``--enabled-hypervisors``,
420
``-H (--hypervisor-parameters)``, ``-B (--backend-parameters)``,
421
``--nic-parameters``, ``-C (--candidate-pool-size)``,
422
``--maintain-node-health``, ``--prealloc-wipe-disks``, ``--uid-pool``,
423
``--node-parameters``, ``--master-netdev`` options are described in
424
the **init** command.
425

    
426
The ``--add-uids`` and ``--remove-uids`` options can be used to
427
modify the user-id pool by adding/removing a list of user-ids or
428
user-id ranges.
429

    
430
The option ``--reserved-lvs`` specifies a list (comma-separated) of
431
logical volume group names (regular expressions) that will be
432
ignored by the cluster verify operation. This is useful if the
433
volume group used for Ganeti is shared with the system for other
434
uses. Note that it's not recommended to create and mark as ignored
435
logical volume names which match Ganeti's own name format (starting
436
with UUID and then .diskN), as this option only skips the
437
verification, but not the actual use of the names given.
438

    
439
To remove all reserved logical volumes, pass in an empty argument
440
to the option, as in ``--reserved-lvs=`` or ``--reserved-lvs ''``.
441

    
442
The ``-I (--default-iallocator)`` is described in the **init**
443
command. To clear the default iallocator, just pass an empty string
444
('').
445

    
446
QUEUE
447
~~~~~
448

    
449
**queue** {drain | undrain | info}
450

    
451
Change job queue properties.
452

    
453
The ``drain`` option sets the drain flag on the job queue. No new
454
jobs will be accepted, but jobs already in the queue will be
455
processed.
456

    
457
The ``undrain`` will unset the drain flag on the job queue. New
458
jobs will be accepted.
459

    
460
The ``info`` option shows the properties of the job queue.
461

    
462
WATCHER
463
~~~~~~~
464

    
465
**watcher** {pause *duration* | continue | info}
466

    
467
Make the watcher pause or let it continue.
468

    
469
The ``pause`` option causes the watcher to pause for *duration*
470
seconds.
471

    
472
The ``continue`` option will let the watcher continue.
473

    
474
The ``info`` option shows whether the watcher is currently paused.
475

    
476
redist-conf
477
~~~~~~~~~~~
478

    
479
**redist-conf** [--submit]
480

    
481
This command forces a full push of configuration files from the
482
master node to the other nodes in the cluster. This is normally not
483
needed, but can be run if the **verify** complains about
484
configuration mismatches.
485

    
486
The ``--submit`` option is used to send the job to the master
487
daemon but not wait for its completion. The job ID will be shown so
488
that it can be examined via **gnt-job info**.
489

    
490
REMOVE-TAGS
491
~~~~~~~~~~~
492

    
493
**remove-tags** [--from *file*] {*tag*...}
494

    
495
Remove tags from the cluster. If any of the tags are not existing
496
on the cluster, the entire operation will abort.
497

    
498
If the ``--from`` option is given, the list of tags to be removed will
499
be extended with the contents of that file (each line becomes a tag).
500
In this case, there is not need to pass tags on the command line (if
501
you do, tags from both sources will be removed). A file name of - will
502
be interpreted as stdin.
503

    
504
RENAME
505
~~~~~~
506

    
507
**rename** [-f] {*name*}
508

    
509
Renames the cluster and in the process updates the master IP
510
address to the one the new name resolves to. At least one of either
511
the name or the IP address must be different, otherwise the
512
operation will be aborted.
513

    
514
Note that since this command can be dangerous (especially when run
515
over SSH), the command will require confirmation unless run with
516
the ``-f`` option.
517

    
518
RENEW-CRYPTO
519
~~~~~~~~~~~~
520

    
521
| **renew-crypto** [-f]
522
| [--new-cluster-certificate] [--new-confd-hmac-key]
523
| [--new-rapi-certificate] [--rapi-certificate *rapi-cert*]
524
| [--new-cluster-domain-secret] [--cluster-domain-secret *filename*]
525

    
526
This command will stop all Ganeti daemons in the cluster and start
527
them again once the new certificates and keys are replicated. The
528
options ``--new-cluster-certificate`` and ``--new-confd-hmac-key``
529
can be used to regenerate the cluster-internal SSL certificate
530
respective the HMAC key used by ganeti-confd(8).
531

    
532
To generate a new self-signed RAPI certificate (used by
533
ganeti-rapi(8)) specify ``--new-rapi-certificate``. If you want to
534
use your own certificate, e.g. one signed by a certificate
535
authority (CA), pass its filename to ``--rapi-certificate``.
536

    
537
``--new-cluster-domain-secret`` generates a new, random cluster
538
domain secret. ``--cluster-domain-secret`` reads the secret from a
539
file. The cluster domain secret is used to sign information
540
exchanged between separate clusters via a third party.
541

    
542
REPAIR-DISK-SIZES
543
~~~~~~~~~~~~~~~~~
544

    
545
**repair-disk-sizes** [instance...]
546

    
547
This command checks that the recorded size of the given instance's
548
disks matches the actual size and updates any mismatches found.
549
This is needed if the Ganeti configuration is no longer consistent
550
with reality, as it will impact some disk operations. If no
551
arguments are given, all instances will be checked.
552

    
553
Note that only active disks can be checked by this command; in case
554
a disk cannot be activated it's advised to use
555
**gnt-instance activate-disks --ignore-size ...** to force
556
activation without regard to the current size.
557

    
558
When the all disk sizes are consistent, the command will return no
559
output. Otherwise it will log details about the inconsistencies in
560
the configuration.
561

    
562
SEARCH-TAGS
563
~~~~~~~~~~~
564

    
565
**search-tags** {*pattern*}
566

    
567
Searches the tags on all objects in the cluster (the cluster
568
itself, the nodes and the instances) for a given pattern. The
569
pattern is interpreted as a regular expression and a search will be
570
done on it (i.e. the given pattern is not anchored to the beggining
571
of the string; if you want that, prefix the pattern with ^).
572

    
573
If no tags are matching the pattern, the exit code of the command
574
will be one. If there is at least one match, the exit code will be
575
zero. Each match is listed on one line, the object and the tag
576
separated by a space. The cluster will be listed as /cluster, a
577
node will be listed as /nodes/*name*, and an instance as
578
/instances/*name*. Example:
579

    
580
::
581

    
582
    # gnt-cluster search-tags time
583
    /cluster ctime:2007-09-01
584
    /nodes/node1.example.com mtime:2007-10-04
585

    
586
VERIFY
587
~~~~~~
588

    
589
**verify** [--no-nplus1-mem] [--node-group *nodegroup*]
590

    
591
Verify correctness of cluster configuration. This is safe with
592
respect to running instances, and incurs no downtime of the
593
instances.
594

    
595
If the ``--no-nplus1-mem`` option is given, Ganeti won't check
596
whether if it loses a node it can restart all the instances on
597
their secondaries (and report an error otherwise).
598

    
599
With ``--node-group``, restrict the verification to those nodes and
600
instances that live in the named group. This will not verify global
601
settings, but will allow to perform verification of a group while other
602
operations are ongoing in other groups.
603

    
604
VERIFY-DISKS
605
~~~~~~~~~~~~
606

    
607
**verify-disks**
608

    
609
The command checks which instances have degraded DRBD disks and
610
activates the disks of those instances.
611

    
612
This command is run from the **ganeti-watcher** tool, which also
613
has a different, complementary algorithm for doing this check.
614
Together, these two should ensure that DRBD disks are kept
615
consistent.
616

    
617
VERSION
618
~~~~~~~
619

    
620
**version**
621

    
622
Show the cluster version.
623

    
624
.. vim: set textwidth=72 :
625
.. Local Variables:
626
.. mode: rst
627
.. fill-column: 72
628
.. End: