root / man / hbal.rst @ 26f7b098
History | View | Annotate | Download (26.6 kB)
1 | 49148d15 | Iustin Pop | HBAL(1) htools | Ganeti H-tools |
---|---|---|---|
2 | 49148d15 | Iustin Pop | =============================== |
3 | 49148d15 | Iustin Pop | |
4 | 49148d15 | Iustin Pop | NAME |
5 | 49148d15 | Iustin Pop | ---- |
6 | 49148d15 | Iustin Pop | |
7 | 49148d15 | Iustin Pop | hbal \- Cluster balancer for Ganeti |
8 | 49148d15 | Iustin Pop | |
9 | 49148d15 | Iustin Pop | SYNOPSIS |
10 | 49148d15 | Iustin Pop | -------- |
11 | 49148d15 | Iustin Pop | |
12 | 49148d15 | Iustin Pop | **hbal** {backend options...} [algorithm options...] [reporting options...] |
13 | 49148d15 | Iustin Pop | |
14 | 49148d15 | Iustin Pop | **hbal** --version |
15 | 49148d15 | Iustin Pop | |
16 | 49148d15 | Iustin Pop | |
17 | 49148d15 | Iustin Pop | Backend options: |
18 | 49148d15 | Iustin Pop | |
19 | 49148d15 | Iustin Pop | { **-m** *cluster* | **-L[** *path* **] [-X]** | **-t** *data-file* } |
20 | 49148d15 | Iustin Pop | |
21 | 49148d15 | Iustin Pop | Algorithm options: |
22 | 49148d15 | Iustin Pop | |
23 | 49148d15 | Iustin Pop | **[ --max-cpu *cpu-ratio* ]** |
24 | 49148d15 | Iustin Pop | **[ --min-disk *disk-ratio* ]** |
25 | 49148d15 | Iustin Pop | **[ -l *limit* ]** |
26 | 49148d15 | Iustin Pop | **[ -e *score* ]** |
27 | 49148d15 | Iustin Pop | **[ -g *delta* ]** **[ --min-gain-limit *threshold* ]** |
28 | 49148d15 | Iustin Pop | **[ -O *name...* ]** |
29 | 49148d15 | Iustin Pop | **[ --no-disk-moves ]** |
30 | 49148d15 | Iustin Pop | **[ -U *util-file* ]** |
31 | 49148d15 | Iustin Pop | **[ --evac-mode ]** |
32 | 49148d15 | Iustin Pop | **[ --exclude-instances *inst...* ]** |
33 | 49148d15 | Iustin Pop | |
34 | 49148d15 | Iustin Pop | Reporting options: |
35 | 49148d15 | Iustin Pop | |
36 | 49148d15 | Iustin Pop | **[ -C[ *file* ] ]** |
37 | 49148d15 | Iustin Pop | **[ -p[ *fields* ] ]** |
38 | 49148d15 | Iustin Pop | **[ --print-instances ]** |
39 | 49148d15 | Iustin Pop | **[ -o ]** |
40 | 49148d15 | Iustin Pop | **[ -v... | -q ]** |
41 | 49148d15 | Iustin Pop | |
42 | 49148d15 | Iustin Pop | |
43 | 49148d15 | Iustin Pop | DESCRIPTION |
44 | 49148d15 | Iustin Pop | ----------- |
45 | 49148d15 | Iustin Pop | |
46 | 49148d15 | Iustin Pop | hbal is a cluster balancer that looks at the current state of the |
47 | 49148d15 | Iustin Pop | cluster (nodes with their total and free disk, memory, etc.) and |
48 | 49148d15 | Iustin Pop | instance placement and computes a series of steps designed to bring |
49 | 49148d15 | Iustin Pop | the cluster into a better state. |
50 | 49148d15 | Iustin Pop | |
51 | 49148d15 | Iustin Pop | The algorithm used is designed to be stable (i.e. it will give you the |
52 | 49148d15 | Iustin Pop | same results when restarting it from the middle of the solution) and |
53 | 49148d15 | Iustin Pop | reasonably fast. It is not, however, designed to be a perfect |
54 | 49148d15 | Iustin Pop | algorithm--it is possible to make it go into a corner from which |
55 | 49148d15 | Iustin Pop | it can find no improvement, because it looks only one "step" ahead. |
56 | 49148d15 | Iustin Pop | |
57 | 49148d15 | Iustin Pop | By default, the program will show the solution incrementally as it is |
58 | 49148d15 | Iustin Pop | computed, in a somewhat cryptic format; for getting the actual Ganeti |
59 | 49148d15 | Iustin Pop | command list, use the **-C** option. |
60 | 49148d15 | Iustin Pop | |
61 | 49148d15 | Iustin Pop | ALGORITHM |
62 | 49148d15 | Iustin Pop | ~~~~~~~~~ |
63 | 49148d15 | Iustin Pop | |
64 | 49148d15 | Iustin Pop | The program works in independent steps; at each step, we compute the |
65 | 49148d15 | Iustin Pop | best instance move that lowers the cluster score. |
66 | 49148d15 | Iustin Pop | |
67 | 49148d15 | Iustin Pop | The possible move type for an instance are combinations of |
68 | 49148d15 | Iustin Pop | failover/migrate and replace-disks such that we change one of the |
69 | 49148d15 | Iustin Pop | instance nodes, and the other one remains (but possibly with changed |
70 | 49148d15 | Iustin Pop | role, e.g. from primary it becomes secondary). The list is: |
71 | 49148d15 | Iustin Pop | |
72 | 49148d15 | Iustin Pop | - failover (f) |
73 | 49148d15 | Iustin Pop | - replace secondary (r) |
74 | 49148d15 | Iustin Pop | - replace primary, a composite move (f, r, f) |
75 | 49148d15 | Iustin Pop | - failover and replace secondary, also composite (f, r) |
76 | 49148d15 | Iustin Pop | - replace secondary and failover, also composite (r, f) |
77 | 49148d15 | Iustin Pop | |
78 | 49148d15 | Iustin Pop | We don't do the only remaining possibility of replacing both nodes |
79 | 49148d15 | Iustin Pop | (r,f,r,f or the equivalent f,r,f,r) since these move needs an |
80 | 49148d15 | Iustin Pop | exhaustive search over both candidate primary and secondary nodes, and |
81 | 49148d15 | Iustin Pop | is O(n*n) in the number of nodes. Furthermore, it doesn't seems to |
82 | 49148d15 | Iustin Pop | give better scores but will result in more disk replacements. |
83 | 49148d15 | Iustin Pop | |
84 | 49148d15 | Iustin Pop | PLACEMENT RESTRICTIONS |
85 | 49148d15 | Iustin Pop | ~~~~~~~~~~~~~~~~~~~~~~ |
86 | 49148d15 | Iustin Pop | |
87 | 49148d15 | Iustin Pop | At each step, we prevent an instance move if it would cause: |
88 | 49148d15 | Iustin Pop | |
89 | 49148d15 | Iustin Pop | - a node to go into N+1 failure state |
90 | 49148d15 | Iustin Pop | - an instance to move onto an offline node (offline nodes are either |
91 | 49148d15 | Iustin Pop | read from the cluster or declared with *-O*) |
92 | 49148d15 | Iustin Pop | - an exclusion-tag based conflict (exclusion tags are read from the |
93 | 49148d15 | Iustin Pop | cluster and/or defined via the *--exclusion-tags* option) |
94 | 49148d15 | Iustin Pop | - a max vcpu/pcpu ratio to be exceeded (configured via *--max-cpu*) |
95 | 49148d15 | Iustin Pop | - min disk free percentage to go below the configured limit |
96 | 49148d15 | Iustin Pop | (configured via *--min-disk*) |
97 | 49148d15 | Iustin Pop | |
98 | 49148d15 | Iustin Pop | CLUSTER SCORING |
99 | 49148d15 | Iustin Pop | ~~~~~~~~~~~~~~~ |
100 | 49148d15 | Iustin Pop | |
101 | 49148d15 | Iustin Pop | As said before, the algorithm tries to minimise the cluster score at |
102 | 49148d15 | Iustin Pop | each step. Currently this score is computed as a sum of the following |
103 | 49148d15 | Iustin Pop | components: |
104 | 49148d15 | Iustin Pop | |
105 | 49148d15 | Iustin Pop | - standard deviation of the percent of free memory |
106 | 49148d15 | Iustin Pop | - standard deviation of the percent of reserved memory |
107 | 49148d15 | Iustin Pop | - standard deviation of the percent of free disk |
108 | 49148d15 | Iustin Pop | - count of nodes failing N+1 check |
109 | 49148d15 | Iustin Pop | - count of instances living (either as primary or secondary) on |
110 | 49148d15 | Iustin Pop | offline nodes |
111 | 49148d15 | Iustin Pop | - count of instances living (as primary) on offline nodes; this |
112 | 49148d15 | Iustin Pop | differs from the above metric by helping failover of such instances |
113 | 49148d15 | Iustin Pop | in 2-node clusters |
114 | 49148d15 | Iustin Pop | - standard deviation of the ratio of virtual-to-physical cpus (for |
115 | 49148d15 | Iustin Pop | primary instances of the node) |
116 | 49148d15 | Iustin Pop | - standard deviation of the dynamic load on the nodes, for cpus, |
117 | 49148d15 | Iustin Pop | memory, disk and network |
118 | 49148d15 | Iustin Pop | |
119 | 49148d15 | Iustin Pop | The free memory and free disk values help ensure that all nodes are |
120 | 49148d15 | Iustin Pop | somewhat balanced in their resource usage. The reserved memory helps |
121 | 49148d15 | Iustin Pop | to ensure that nodes are somewhat balanced in holding secondary |
122 | 49148d15 | Iustin Pop | instances, and that no node keeps too much memory reserved for |
123 | 49148d15 | Iustin Pop | N+1. And finally, the N+1 percentage helps guide the algorithm towards |
124 | 49148d15 | Iustin Pop | eliminating N+1 failures, if possible. |
125 | 49148d15 | Iustin Pop | |
126 | 49148d15 | Iustin Pop | Except for the N+1 failures and offline instances counts, we use the |
127 | 49148d15 | Iustin Pop | standard deviation since when used with values within a fixed range |
128 | 49148d15 | Iustin Pop | (we use percents expressed as values between zero and one) it gives |
129 | 49148d15 | Iustin Pop | consistent results across all metrics (there are some small issues |
130 | 49148d15 | Iustin Pop | related to different means, but it works generally well). The 'count' |
131 | 49148d15 | Iustin Pop | type values will have higher score and thus will matter more for |
132 | 49148d15 | Iustin Pop | balancing; thus these are better for hard constraints (like evacuating |
133 | 49148d15 | Iustin Pop | nodes and fixing N+1 failures). For example, the offline instances |
134 | 49148d15 | Iustin Pop | count (i.e. the number of instances living on offline nodes) will |
135 | 49148d15 | Iustin Pop | cause the algorithm to actively move instances away from offline |
136 | 49148d15 | Iustin Pop | nodes. This, coupled with the restriction on placement given by |
137 | 49148d15 | Iustin Pop | offline nodes, will cause evacuation of such nodes. |
138 | 49148d15 | Iustin Pop | |
139 | 49148d15 | Iustin Pop | The dynamic load values need to be read from an external file (Ganeti |
140 | 49148d15 | Iustin Pop | doesn't supply them), and are computed for each node as: sum of |
141 | 49148d15 | Iustin Pop | primary instance cpu load, sum of primary instance memory load, sum of |
142 | 49148d15 | Iustin Pop | primary and secondary instance disk load (as DRBD generates write load |
143 | 49148d15 | Iustin Pop | on secondary nodes too in normal case and in degraded scenarios also |
144 | 49148d15 | Iustin Pop | read load), and sum of primary instance network load. An example of |
145 | 49148d15 | Iustin Pop | how to generate these values for input to hbal would be to track ``xm |
146 | 49148d15 | Iustin Pop | list`` for instances over a day and by computing the delta of the cpu |
147 | 49148d15 | Iustin Pop | values, and feed that via the *-U* option for all instances (and keep |
148 | 49148d15 | Iustin Pop | the other metrics as one). For the algorithm to work, all that is |
149 | 49148d15 | Iustin Pop | needed is that the values are consistent for a metric across all |
150 | 49148d15 | Iustin Pop | instances (e.g. all instances use cpu% to report cpu usage, and not |
151 | 49148d15 | Iustin Pop | something related to number of CPU seconds used if the CPUs are |
152 | 49148d15 | Iustin Pop | different), and that they are normalised to between zero and one. Note |
153 | 49148d15 | Iustin Pop | that it's recommended to not have zero as the load value for any |
154 | 49148d15 | Iustin Pop | instance metric since then secondary instances are not well balanced. |
155 | 49148d15 | Iustin Pop | |
156 | 49148d15 | Iustin Pop | On a perfectly balanced cluster (all nodes the same size, all |
157 | 49148d15 | Iustin Pop | instances the same size and spread across the nodes equally), the |
158 | 49148d15 | Iustin Pop | values for all metrics would be zero. This doesn't happen too often in |
159 | 49148d15 | Iustin Pop | practice :) |
160 | 49148d15 | Iustin Pop | |
161 | 49148d15 | Iustin Pop | OFFLINE INSTANCES |
162 | 49148d15 | Iustin Pop | ~~~~~~~~~~~~~~~~~ |
163 | 49148d15 | Iustin Pop | |
164 | 49148d15 | Iustin Pop | Since current Ganeti versions do not report the memory used by offline |
165 | 49148d15 | Iustin Pop | (down) instances, ignoring the run status of instances will cause |
166 | 49148d15 | Iustin Pop | wrong calculations. For this reason, the algorithm subtracts the |
167 | 49148d15 | Iustin Pop | memory size of down instances from the free node memory of their |
168 | 49148d15 | Iustin Pop | primary node, in effect simulating the startup of such instances. |
169 | 49148d15 | Iustin Pop | |
170 | 49148d15 | Iustin Pop | EXCLUSION TAGS |
171 | 49148d15 | Iustin Pop | ~~~~~~~~~~~~~~ |
172 | 49148d15 | Iustin Pop | |
173 | 49148d15 | Iustin Pop | The exclusion tags mechanism is designed to prevent instances which |
174 | 49148d15 | Iustin Pop | run the same workload (e.g. two DNS servers) to land on the same node, |
175 | 49148d15 | Iustin Pop | which would make the respective node a SPOF for the given service. |
176 | 49148d15 | Iustin Pop | |
177 | 49148d15 | Iustin Pop | It works by tagging instances with certain tags and then building |
178 | 49148d15 | Iustin Pop | exclusion maps based on these. Which tags are actually used is |
179 | 49148d15 | Iustin Pop | configured either via the command line (option *--exclusion-tags*) |
180 | 49148d15 | Iustin Pop | or via adding them to the cluster tags: |
181 | 49148d15 | Iustin Pop | |
182 | 49148d15 | Iustin Pop | --exclusion-tags=a,b |
183 | 49148d15 | Iustin Pop | This will make all instance tags of the form *a:\**, *b:\** be |
184 | 49148d15 | Iustin Pop | considered for the exclusion map |
185 | 49148d15 | Iustin Pop | |
186 | 49148d15 | Iustin Pop | cluster tags *htools:iextags:a*, *htools:iextags:b* |
187 | 49148d15 | Iustin Pop | This will make instance tags *a:\**, *b:\** be considered for the |
188 | 49148d15 | Iustin Pop | exclusion map. More precisely, the suffix of cluster tags starting |
189 | 49148d15 | Iustin Pop | with *htools:iextags:* will become the prefix of the exclusion tags. |
190 | 49148d15 | Iustin Pop | |
191 | 49148d15 | Iustin Pop | Both the above forms mean that two instances both having (e.g.) the |
192 | 49148d15 | Iustin Pop | tag *a:foo* or *b:bar* won't end on the same node. |
193 | 49148d15 | Iustin Pop | |
194 | 49148d15 | Iustin Pop | OPTIONS |
195 | 49148d15 | Iustin Pop | ------- |
196 | 49148d15 | Iustin Pop | |
197 | 49148d15 | Iustin Pop | The options that can be passed to the program are as follows: |
198 | 49148d15 | Iustin Pop | |
199 | 49148d15 | Iustin Pop | -C, --print-commands |
200 | 49148d15 | Iustin Pop | Print the command list at the end of the run. Without this, the |
201 | 49148d15 | Iustin Pop | program will only show a shorter, but cryptic output. |
202 | 49148d15 | Iustin Pop | |
203 | 49148d15 | Iustin Pop | Note that the moves list will be split into independent steps, |
204 | 49148d15 | Iustin Pop | called "jobsets", but only for visual inspection, not for actually |
205 | 49148d15 | Iustin Pop | parallelisation. It is not possible to parallelise these directly |
206 | 49148d15 | Iustin Pop | when executed via "gnt-instance" commands, since a compound command |
207 | 49148d15 | Iustin Pop | (e.g. failover and replace-disks) must be executed |
208 | 49148d15 | Iustin Pop | serially. Parallel execution is only possible when using the Luxi |
209 | 49148d15 | Iustin Pop | backend and the *-L* option. |
210 | 49148d15 | Iustin Pop | |
211 | 49148d15 | Iustin Pop | The algorithm for splitting the moves into jobsets is by |
212 | 49148d15 | Iustin Pop | accumulating moves until the next move is touching nodes already |
213 | 49148d15 | Iustin Pop | touched by the current moves; this means we can't execute in |
214 | 49148d15 | Iustin Pop | parallel (due to resource allocation in Ganeti) and thus we start a |
215 | 49148d15 | Iustin Pop | new jobset. |
216 | 49148d15 | Iustin Pop | |
217 | 49148d15 | Iustin Pop | -p, --print-nodes |
218 | 49148d15 | Iustin Pop | Prints the before and after node status, in a format designed to |
219 | 49148d15 | Iustin Pop | allow the user to understand the node's most important parameters. |
220 | 49148d15 | Iustin Pop | |
221 | 49148d15 | Iustin Pop | It is possible to customise the listed information by passing a |
222 | 49148d15 | Iustin Pop | comma-separated list of field names to this option (the field list |
223 | 49148d15 | Iustin Pop | is currently undocumented), or to extend the default field list by |
224 | 49148d15 | Iustin Pop | prefixing the additional field list with a plus sign. By default, |
225 | 49148d15 | Iustin Pop | the node list will contain the following information: |
226 | 49148d15 | Iustin Pop | |
227 | 49148d15 | Iustin Pop | F |
228 | 49148d15 | Iustin Pop | a character denoting the status of the node, with '-' meaning an |
229 | 49148d15 | Iustin Pop | offline node, '*' meaning N+1 failure and blank meaning a good |
230 | 49148d15 | Iustin Pop | node |
231 | 49148d15 | Iustin Pop | |
232 | 49148d15 | Iustin Pop | Name |
233 | 49148d15 | Iustin Pop | the node name |
234 | 49148d15 | Iustin Pop | |
235 | 49148d15 | Iustin Pop | t_mem |
236 | 49148d15 | Iustin Pop | the total node memory |
237 | 49148d15 | Iustin Pop | |
238 | 49148d15 | Iustin Pop | n_mem |
239 | 49148d15 | Iustin Pop | the memory used by the node itself |
240 | 49148d15 | Iustin Pop | |
241 | 49148d15 | Iustin Pop | i_mem |
242 | 49148d15 | Iustin Pop | the memory used by instances |
243 | 49148d15 | Iustin Pop | |
244 | 49148d15 | Iustin Pop | x_mem |
245 | 49148d15 | Iustin Pop | amount memory which seems to be in use but cannot be determined |
246 | 49148d15 | Iustin Pop | why or by which instance; usually this means that the hypervisor |
247 | 49148d15 | Iustin Pop | has some overhead or that there are other reporting errors |
248 | 49148d15 | Iustin Pop | |
249 | 49148d15 | Iustin Pop | f_mem |
250 | 49148d15 | Iustin Pop | the free node memory |
251 | 49148d15 | Iustin Pop | |
252 | 49148d15 | Iustin Pop | r_mem |
253 | 49148d15 | Iustin Pop | the reserved node memory, which is the amount of free memory |
254 | 49148d15 | Iustin Pop | needed for N+1 compliance |
255 | 49148d15 | Iustin Pop | |
256 | 49148d15 | Iustin Pop | t_dsk |
257 | 49148d15 | Iustin Pop | total disk |
258 | 49148d15 | Iustin Pop | |
259 | 49148d15 | Iustin Pop | f_dsk |
260 | 49148d15 | Iustin Pop | free disk |
261 | 49148d15 | Iustin Pop | |
262 | 49148d15 | Iustin Pop | pcpu |
263 | 49148d15 | Iustin Pop | the number of physical cpus on the node |
264 | 49148d15 | Iustin Pop | |
265 | 49148d15 | Iustin Pop | vcpu |
266 | 49148d15 | Iustin Pop | the number of virtual cpus allocated to primary instances |
267 | 49148d15 | Iustin Pop | |
268 | 49148d15 | Iustin Pop | pcnt |
269 | 49148d15 | Iustin Pop | number of primary instances |
270 | 49148d15 | Iustin Pop | |
271 | 49148d15 | Iustin Pop | scnt |
272 | 49148d15 | Iustin Pop | number of secondary instances |
273 | 49148d15 | Iustin Pop | |
274 | 49148d15 | Iustin Pop | p_fmem |
275 | 49148d15 | Iustin Pop | percent of free memory |
276 | 49148d15 | Iustin Pop | |
277 | 49148d15 | Iustin Pop | p_fdsk |
278 | 49148d15 | Iustin Pop | percent of free disk |
279 | 49148d15 | Iustin Pop | |
280 | 49148d15 | Iustin Pop | r_cpu |
281 | 49148d15 | Iustin Pop | ratio of virtual to physical cpus |
282 | 49148d15 | Iustin Pop | |
283 | 49148d15 | Iustin Pop | lCpu |
284 | 49148d15 | Iustin Pop | the dynamic CPU load (if the information is available) |
285 | 49148d15 | Iustin Pop | |
286 | 49148d15 | Iustin Pop | lMem |
287 | 49148d15 | Iustin Pop | the dynamic memory load (if the information is available) |
288 | 49148d15 | Iustin Pop | |
289 | 49148d15 | Iustin Pop | lDsk |
290 | 49148d15 | Iustin Pop | the dynamic disk load (if the information is available) |
291 | 49148d15 | Iustin Pop | |
292 | 49148d15 | Iustin Pop | lNet |
293 | 49148d15 | Iustin Pop | the dynamic net load (if the information is available) |
294 | 49148d15 | Iustin Pop | |
295 | 49148d15 | Iustin Pop | --print-instances |
296 | 49148d15 | Iustin Pop | Prints the before and after instance map. This is less useful as the |
297 | 49148d15 | Iustin Pop | node status, but it can help in understanding instance moves. |
298 | 49148d15 | Iustin Pop | |
299 | 49148d15 | Iustin Pop | -o, --oneline |
300 | 49148d15 | Iustin Pop | Only shows a one-line output from the program, designed for the case |
301 | 49148d15 | Iustin Pop | when one wants to look at multiple clusters at once and check their |
302 | 49148d15 | Iustin Pop | status. |
303 | 49148d15 | Iustin Pop | |
304 | 49148d15 | Iustin Pop | The line will contain four fields: |
305 | 49148d15 | Iustin Pop | |
306 | 49148d15 | Iustin Pop | - initial cluster score |
307 | 49148d15 | Iustin Pop | - number of steps in the solution |
308 | 49148d15 | Iustin Pop | - final cluster score |
309 | 49148d15 | Iustin Pop | - improvement in the cluster score |
310 | 49148d15 | Iustin Pop | |
311 | 49148d15 | Iustin Pop | -O *name* |
312 | 49148d15 | Iustin Pop | This option (which can be given multiple times) will mark nodes as |
313 | 49148d15 | Iustin Pop | being *offline*. This means a couple of things: |
314 | 49148d15 | Iustin Pop | |
315 | 49148d15 | Iustin Pop | - instances won't be placed on these nodes, not even temporarily; |
316 | 49148d15 | Iustin Pop | e.g. the *replace primary* move is not available if the secondary |
317 | 49148d15 | Iustin Pop | node is offline, since this move requires a failover. |
318 | 49148d15 | Iustin Pop | - these nodes will not be included in the score calculation (except |
319 | 49148d15 | Iustin Pop | for the percentage of instances on offline nodes) |
320 | 49148d15 | Iustin Pop | |
321 | 49148d15 | Iustin Pop | Note that algorithm will also mark as offline any nodes which are |
322 | 49148d15 | Iustin Pop | reported by RAPI as such, or that have "?" in file-based input in |
323 | 49148d15 | Iustin Pop | any numeric fields. |
324 | 49148d15 | Iustin Pop | |
325 | 49148d15 | Iustin Pop | -e *score*, --min-score=*score* |
326 | 49148d15 | Iustin Pop | This parameter denotes the minimum score we are happy with and alters |
327 | 49148d15 | Iustin Pop | the computation in two ways: |
328 | 49148d15 | Iustin Pop | |
329 | 49148d15 | Iustin Pop | - if the cluster has the initial score lower than this value, then we |
330 | 49148d15 | Iustin Pop | don't enter the algorithm at all, and exit with success |
331 | 49148d15 | Iustin Pop | - during the iterative process, if we reach a score lower than this |
332 | 49148d15 | Iustin Pop | value, we exit the algorithm |
333 | 49148d15 | Iustin Pop | |
334 | 49148d15 | Iustin Pop | The default value of the parameter is currently ``1e-9`` (chosen |
335 | 49148d15 | Iustin Pop | empirically). |
336 | 49148d15 | Iustin Pop | |
337 | 49148d15 | Iustin Pop | -g *delta*, --min-gain=*delta* |
338 | 49148d15 | Iustin Pop | Since the balancing algorithm can sometimes result in just very tiny |
339 | 49148d15 | Iustin Pop | improvements, that bring less gain that they cost in relocation |
340 | 49148d15 | Iustin Pop | time, this parameter (defaulting to 0.01) represents the minimum |
341 | 49148d15 | Iustin Pop | gain we require during a step, to continue balancing. |
342 | 49148d15 | Iustin Pop | |
343 | 49148d15 | Iustin Pop | --min-gain-limit=*threshold* |
344 | 49148d15 | Iustin Pop | The above min-gain option will only take effect if the cluster score |
345 | 49148d15 | Iustin Pop | is already below *threshold* (defaults to 0.1). The rationale behind |
346 | 49148d15 | Iustin Pop | this setting is that at high cluster scores (badly balanced |
347 | 49148d15 | Iustin Pop | clusters), we don't want to abort the rebalance too quickly, as |
348 | 49148d15 | Iustin Pop | later gains might still be significant. However, under the |
349 | 49148d15 | Iustin Pop | threshold, the total gain is only the threshold value, so we can |
350 | 49148d15 | Iustin Pop | exit early. |
351 | 49148d15 | Iustin Pop | |
352 | 49148d15 | Iustin Pop | --no-disk-moves |
353 | 49148d15 | Iustin Pop | This parameter prevents hbal from using disk move |
354 | 49148d15 | Iustin Pop | (i.e. "gnt-instance replace-disks") operations. This will result in |
355 | 49148d15 | Iustin Pop | a much quicker balancing, but of course the improvements are |
356 | 49148d15 | Iustin Pop | limited. It is up to the user to decide when to use one or another. |
357 | 49148d15 | Iustin Pop | |
358 | 49148d15 | Iustin Pop | --evac-mode |
359 | 49148d15 | Iustin Pop | This parameter restricts the list of instances considered for moving |
360 | 49148d15 | Iustin Pop | to the ones living on offline/drained nodes. It can be used as a |
361 | 49148d15 | Iustin Pop | (bulk) replacement for Ganeti's own *gnt-node evacuate*, with the |
362 | 49148d15 | Iustin Pop | note that it doesn't guarantee full evacuation. |
363 | 49148d15 | Iustin Pop | |
364 | 49148d15 | Iustin Pop | --exclude-instances=*instances* |
365 | 49148d15 | Iustin Pop | This parameter marks the given instances (as a comma-separated list) |
366 | 49148d15 | Iustin Pop | from being moved during the rebalance. |
367 | 49148d15 | Iustin Pop | |
368 | 49148d15 | Iustin Pop | -U *util-file* |
369 | 49148d15 | Iustin Pop | This parameter specifies a file holding instance dynamic utilisation |
370 | 49148d15 | Iustin Pop | information that will be used to tweak the balancing algorithm to |
371 | 49148d15 | Iustin Pop | equalise load on the nodes (as opposed to static resource |
372 | 49148d15 | Iustin Pop | usage). The file is in the format "instance_name cpu_util mem_util |
373 | 49148d15 | Iustin Pop | disk_util net_util" where the "_util" parameters are interpreted as |
374 | 49148d15 | Iustin Pop | numbers and the instance name must match exactly the instance as |
375 | 49148d15 | Iustin Pop | read from Ganeti. In case of unknown instance names, the program |
376 | 49148d15 | Iustin Pop | will abort. |
377 | 49148d15 | Iustin Pop | |
378 | 49148d15 | Iustin Pop | If not given, the default values are one for all metrics and thus |
379 | 49148d15 | Iustin Pop | dynamic utilisation has only one effect on the algorithm: the |
380 | 49148d15 | Iustin Pop | equalisation of the secondary instances across nodes (this is the |
381 | 49148d15 | Iustin Pop | only metric that is not tracked by another, dedicated value, and |
382 | 49148d15 | Iustin Pop | thus the disk load of instances will cause secondary instance |
383 | 49148d15 | Iustin Pop | equalisation). Note that value of one will also influence slightly |
384 | 49148d15 | Iustin Pop | the primary instance count, but that is already tracked via other |
385 | 49148d15 | Iustin Pop | metrics and thus the influence of the dynamic utilisation will be |
386 | 49148d15 | Iustin Pop | practically insignificant. |
387 | 49148d15 | Iustin Pop | |
388 | 49148d15 | Iustin Pop | -t *datafile*, --text-data=*datafile* |
389 | 49148d15 | Iustin Pop | The name of the file holding node and instance information (if not |
390 | 49148d15 | Iustin Pop | collecting via RAPI or LUXI). This or one of the other backends must |
391 | 49148d15 | Iustin Pop | be selected. |
392 | 49148d15 | Iustin Pop | |
393 | 4188449c | Iustin Pop | -S *filename*, --save-cluster=*filename* |
394 | 4188449c | Iustin Pop | If given, the state of the cluster before the balancing is saved to |
395 | 4188449c | Iustin Pop | the given file plus the extension "original" |
396 | 4188449c | Iustin Pop | (i.e. *filename*.original), and the state at the end of the |
397 | 4188449c | Iustin Pop | balancing is saved to the given file plus the extension "balanced" |
398 | 4188449c | Iustin Pop | (i.e. *filename*.balanced). This allows re-feeding the cluster state |
399 | 4188449c | Iustin Pop | to either hbal itself or for example hspace. |
400 | 49148d15 | Iustin Pop | |
401 | 49148d15 | Iustin Pop | -m *cluster* |
402 | 49148d15 | Iustin Pop | Collect data directly from the *cluster* given as an argument via |
403 | 49148d15 | Iustin Pop | RAPI. If the argument doesn't contain a colon (:), then it is |
404 | 49148d15 | Iustin Pop | converted into a fully-built URL via prepending ``https://`` and |
405 | 49148d15 | Iustin Pop | appending the default RAPI port, otherwise it's considered a |
406 | 49148d15 | Iustin Pop | fully-specified URL and is used as-is. |
407 | 49148d15 | Iustin Pop | |
408 | 49148d15 | Iustin Pop | -L [*path*] |
409 | 49148d15 | Iustin Pop | Collect data directly from the master daemon, which is to be |
410 | 49148d15 | Iustin Pop | contacted via the luxi (an internal Ganeti protocol). An optional |
411 | 49148d15 | Iustin Pop | *path* argument is interpreted as the path to the unix socket on |
412 | 49148d15 | Iustin Pop | which the master daemon listens; otherwise, the default path used by |
413 | 49148d15 | Iustin Pop | ganeti when installed with *--localstatedir=/var* is used. |
414 | 49148d15 | Iustin Pop | |
415 | 49148d15 | Iustin Pop | -X |
416 | 49148d15 | Iustin Pop | When using the Luxi backend, hbal can also execute the given |
417 | 49148d15 | Iustin Pop | commands. The execution method is to execute the individual jobsets |
418 | 49148d15 | Iustin Pop | (see the *-C* option for details) in separate stages, aborting if at |
419 | 49148d15 | Iustin Pop | any time a jobset doesn't have all jobs successful. Each step in the |
420 | 49148d15 | Iustin Pop | balancing solution will be translated into exactly one Ganeti job |
421 | 49148d15 | Iustin Pop | (having between one and three OpCodes), and all the steps in a |
422 | 49148d15 | Iustin Pop | jobset will be executed in parallel. The jobsets themselves are |
423 | 49148d15 | Iustin Pop | executed serially. |
424 | 49148d15 | Iustin Pop | |
425 | 49148d15 | Iustin Pop | -l *N*, --max-length=*N* |
426 | 49148d15 | Iustin Pop | Restrict the solution to this length. This can be used for example |
427 | 49148d15 | Iustin Pop | to automate the execution of the balancing. |
428 | 49148d15 | Iustin Pop | |
429 | 49148d15 | Iustin Pop | --max-cpu=*cpu-ratio* |
430 | 49148d15 | Iustin Pop | The maximum virtual to physical cpu ratio, as a floating point |
431 | 49148d15 | Iustin Pop | number between zero and one. For example, specifying *cpu-ratio* as |
432 | 49148d15 | Iustin Pop | **2.5** means that, for a 4-cpu machine, a maximum of 10 virtual |
433 | 49148d15 | Iustin Pop | cpus should be allowed to be in use for primary instances. A value |
434 | 49148d15 | Iustin Pop | of one doesn't make sense though, as that means no disk space can be |
435 | 49148d15 | Iustin Pop | used on it. |
436 | 49148d15 | Iustin Pop | |
437 | 49148d15 | Iustin Pop | --min-disk=*disk-ratio* |
438 | 49148d15 | Iustin Pop | The minimum amount of free disk space remaining, as a floating point |
439 | 49148d15 | Iustin Pop | number. For example, specifying *disk-ratio* as **0.25** means that |
440 | 49148d15 | Iustin Pop | at least one quarter of disk space should be left free on nodes. |
441 | 49148d15 | Iustin Pop | |
442 | 646aa028 | Iustin Pop | -G *uuid*, --group=*uuid* |
443 | 646aa028 | Iustin Pop | On an multi-group cluster, select this group for |
444 | 646aa028 | Iustin Pop | processing. Otherwise hbal will abort, since it cannot balance |
445 | 646aa028 | Iustin Pop | multiple groups at the same time. |
446 | 646aa028 | Iustin Pop | |
447 | 49148d15 | Iustin Pop | -v, --verbose |
448 | 49148d15 | Iustin Pop | Increase the output verbosity. Each usage of this option will |
449 | 49148d15 | Iustin Pop | increase the verbosity (currently more than 2 doesn't make sense) |
450 | 49148d15 | Iustin Pop | from the default of one. |
451 | 49148d15 | Iustin Pop | |
452 | 49148d15 | Iustin Pop | -q, --quiet |
453 | 49148d15 | Iustin Pop | Decrease the output verbosity. Each usage of this option will |
454 | 49148d15 | Iustin Pop | decrease the verbosity (less than zero doesn't make sense) from the |
455 | 49148d15 | Iustin Pop | default of one. |
456 | 49148d15 | Iustin Pop | |
457 | 49148d15 | Iustin Pop | -V, --version |
458 | 49148d15 | Iustin Pop | Just show the program version and exit. |
459 | 49148d15 | Iustin Pop | |
460 | 49148d15 | Iustin Pop | EXIT STATUS |
461 | 49148d15 | Iustin Pop | ----------- |
462 | 49148d15 | Iustin Pop | |
463 | 6656790a | Iustin Pop | The exit status of the command will be zero, unless for some reason |
464 | 6656790a | Iustin Pop | the algorithm fatally failed (e.g. wrong node or instance data), or |
465 | 6656790a | Iustin Pop | (in case of job execution) any job has failed. |
466 | 49148d15 | Iustin Pop | |
467 | 49148d15 | Iustin Pop | BUGS |
468 | 49148d15 | Iustin Pop | ---- |
469 | 49148d15 | Iustin Pop | |
470 | 49148d15 | Iustin Pop | The program does not check its input data for consistency, and aborts |
471 | 49148d15 | Iustin Pop | with cryptic errors messages in this case. |
472 | 49148d15 | Iustin Pop | |
473 | 49148d15 | Iustin Pop | The algorithm is not perfect. |
474 | 49148d15 | Iustin Pop | |
475 | 49148d15 | Iustin Pop | The output format is not easily scriptable, and the program should |
476 | 49148d15 | Iustin Pop | feed moves directly into Ganeti (either via RAPI or via a gnt-debug |
477 | 49148d15 | Iustin Pop | input file). |
478 | 49148d15 | Iustin Pop | |
479 | 49148d15 | Iustin Pop | EXAMPLE |
480 | 49148d15 | Iustin Pop | ------- |
481 | 49148d15 | Iustin Pop | |
482 | 49148d15 | Iustin Pop | Note that these examples are not for the latest version (they don't |
483 | 49148d15 | Iustin Pop | have full node data). |
484 | 49148d15 | Iustin Pop | |
485 | 49148d15 | Iustin Pop | Default output |
486 | 49148d15 | Iustin Pop | ~~~~~~~~~~~~~~ |
487 | 49148d15 | Iustin Pop | |
488 | 49148d15 | Iustin Pop | With the default options, the program shows each individual step and |
489 | 49148d15 | Iustin Pop | the improvements it brings in cluster score:: |
490 | 49148d15 | Iustin Pop | |
491 | 49148d15 | Iustin Pop | $ hbal |
492 | 49148d15 | Iustin Pop | Loaded 20 nodes, 80 instances |
493 | 49148d15 | Iustin Pop | Cluster is not N+1 happy, continuing but no guarantee that the cluster will end N+1 happy. |
494 | 49148d15 | Iustin Pop | Initial score: 0.52329131 |
495 | 49148d15 | Iustin Pop | Trying to minimize the CV... |
496 | 49148d15 | Iustin Pop | 1. instance14 node1:node10 => node16:node10 0.42109120 a=f r:node16 f |
497 | 49148d15 | Iustin Pop | 2. instance54 node4:node15 => node16:node15 0.31904594 a=f r:node16 f |
498 | 49148d15 | Iustin Pop | 3. instance4 node5:node2 => node2:node16 0.26611015 a=f r:node16 |
499 | 49148d15 | Iustin Pop | 4. instance48 node18:node20 => node2:node18 0.21361717 a=r:node2 f |
500 | 49148d15 | Iustin Pop | 5. instance93 node19:node18 => node16:node19 0.16166425 a=r:node16 f |
501 | 49148d15 | Iustin Pop | 6. instance89 node3:node20 => node2:node3 0.11005629 a=r:node2 f |
502 | 49148d15 | Iustin Pop | 7. instance5 node6:node2 => node16:node6 0.05841589 a=r:node16 f |
503 | 49148d15 | Iustin Pop | 8. instance94 node7:node20 => node20:node16 0.00658759 a=f r:node16 |
504 | 49148d15 | Iustin Pop | 9. instance44 node20:node2 => node2:node15 0.00438740 a=f r:node15 |
505 | 49148d15 | Iustin Pop | 10. instance62 node14:node18 => node14:node16 0.00390087 a=r:node16 |
506 | 49148d15 | Iustin Pop | 11. instance13 node11:node14 => node11:node16 0.00361787 a=r:node16 |
507 | 49148d15 | Iustin Pop | 12. instance19 node10:node11 => node10:node7 0.00336636 a=r:node7 |
508 | 49148d15 | Iustin Pop | 13. instance43 node12:node13 => node12:node1 0.00305681 a=r:node1 |
509 | 49148d15 | Iustin Pop | 14. instance1 node1:node2 => node1:node4 0.00263124 a=r:node4 |
510 | 49148d15 | Iustin Pop | 15. instance58 node19:node20 => node19:node17 0.00252594 a=r:node17 |
511 | 49148d15 | Iustin Pop | Cluster score improved from 0.52329131 to 0.00252594 |
512 | 49148d15 | Iustin Pop | |
513 | 49148d15 | Iustin Pop | In the above output, we can see: |
514 | 49148d15 | Iustin Pop | |
515 | 49148d15 | Iustin Pop | - the input data (here from files) shows a cluster with 20 nodes and |
516 | 49148d15 | Iustin Pop | 80 instances |
517 | 49148d15 | Iustin Pop | - the cluster is not initially N+1 compliant |
518 | 49148d15 | Iustin Pop | - the initial score is 0.52329131 |
519 | 49148d15 | Iustin Pop | |
520 | 49148d15 | Iustin Pop | The step list follows, showing the instance, its initial |
521 | 49148d15 | Iustin Pop | primary/secondary nodes, the new primary secondary, the cluster list, |
522 | 49148d15 | Iustin Pop | and the actions taken in this step (with 'f' denoting failover/migrate |
523 | 49148d15 | Iustin Pop | and 'r' denoting replace secondary). |
524 | 49148d15 | Iustin Pop | |
525 | 49148d15 | Iustin Pop | Finally, the program shows the improvement in cluster score. |
526 | 49148d15 | Iustin Pop | |
527 | 49148d15 | Iustin Pop | A more detailed output is obtained via the *-C* and *-p* options:: |
528 | 49148d15 | Iustin Pop | |
529 | 49148d15 | Iustin Pop | $ hbal |
530 | 49148d15 | Iustin Pop | Loaded 20 nodes, 80 instances |
531 | 49148d15 | Iustin Pop | Cluster is not N+1 happy, continuing but no guarantee that the cluster will end N+1 happy. |
532 | 49148d15 | Iustin Pop | Initial cluster status: |
533 | 49148d15 | Iustin Pop | N1 Name t_mem f_mem r_mem t_dsk f_dsk pri sec p_fmem p_fdsk |
534 | 49148d15 | Iustin Pop | * node1 32762 1280 6000 1861 1026 5 3 0.03907 0.55179 |
535 | 49148d15 | Iustin Pop | node2 32762 31280 12000 1861 1026 0 8 0.95476 0.55179 |
536 | 49148d15 | Iustin Pop | * node3 32762 1280 6000 1861 1026 5 3 0.03907 0.55179 |
537 | 49148d15 | Iustin Pop | * node4 32762 1280 6000 1861 1026 5 3 0.03907 0.55179 |
538 | 49148d15 | Iustin Pop | * node5 32762 1280 6000 1861 978 5 5 0.03907 0.52573 |
539 | 49148d15 | Iustin Pop | * node6 32762 1280 6000 1861 1026 5 3 0.03907 0.55179 |
540 | 49148d15 | Iustin Pop | * node7 32762 1280 6000 1861 1026 5 3 0.03907 0.55179 |
541 | 49148d15 | Iustin Pop | node8 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
542 | 49148d15 | Iustin Pop | node9 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
543 | 49148d15 | Iustin Pop | * node10 32762 7280 12000 1861 1026 4 4 0.22221 0.55179 |
544 | 49148d15 | Iustin Pop | node11 32762 7280 6000 1861 922 4 5 0.22221 0.49577 |
545 | 49148d15 | Iustin Pop | node12 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
546 | 49148d15 | Iustin Pop | node13 32762 7280 6000 1861 922 4 5 0.22221 0.49577 |
547 | 49148d15 | Iustin Pop | node14 32762 7280 6000 1861 922 4 5 0.22221 0.49577 |
548 | 49148d15 | Iustin Pop | * node15 32762 7280 12000 1861 1131 4 3 0.22221 0.60782 |
549 | 49148d15 | Iustin Pop | node16 32762 31280 0 1861 1860 0 0 0.95476 1.00000 |
550 | 49148d15 | Iustin Pop | node17 32762 7280 6000 1861 1106 5 3 0.22221 0.59479 |
551 | 49148d15 | Iustin Pop | * node18 32762 1280 6000 1396 561 5 3 0.03907 0.40239 |
552 | 49148d15 | Iustin Pop | * node19 32762 1280 6000 1861 1026 5 3 0.03907 0.55179 |
553 | 49148d15 | Iustin Pop | node20 32762 13280 12000 1861 689 3 9 0.40535 0.37068 |
554 | 49148d15 | Iustin Pop | |
555 | 49148d15 | Iustin Pop | Initial score: 0.52329131 |
556 | 49148d15 | Iustin Pop | Trying to minimize the CV... |
557 | 49148d15 | Iustin Pop | 1. instance14 node1:node10 => node16:node10 0.42109120 a=f r:node16 f |
558 | 49148d15 | Iustin Pop | 2. instance54 node4:node15 => node16:node15 0.31904594 a=f r:node16 f |
559 | 49148d15 | Iustin Pop | 3. instance4 node5:node2 => node2:node16 0.26611015 a=f r:node16 |
560 | 49148d15 | Iustin Pop | 4. instance48 node18:node20 => node2:node18 0.21361717 a=r:node2 f |
561 | 49148d15 | Iustin Pop | 5. instance93 node19:node18 => node16:node19 0.16166425 a=r:node16 f |
562 | 49148d15 | Iustin Pop | 6. instance89 node3:node20 => node2:node3 0.11005629 a=r:node2 f |
563 | 49148d15 | Iustin Pop | 7. instance5 node6:node2 => node16:node6 0.05841589 a=r:node16 f |
564 | 49148d15 | Iustin Pop | 8. instance94 node7:node20 => node20:node16 0.00658759 a=f r:node16 |
565 | 49148d15 | Iustin Pop | 9. instance44 node20:node2 => node2:node15 0.00438740 a=f r:node15 |
566 | 49148d15 | Iustin Pop | 10. instance62 node14:node18 => node14:node16 0.00390087 a=r:node16 |
567 | 49148d15 | Iustin Pop | 11. instance13 node11:node14 => node11:node16 0.00361787 a=r:node16 |
568 | 49148d15 | Iustin Pop | 12. instance19 node10:node11 => node10:node7 0.00336636 a=r:node7 |
569 | 49148d15 | Iustin Pop | 13. instance43 node12:node13 => node12:node1 0.00305681 a=r:node1 |
570 | 49148d15 | Iustin Pop | 14. instance1 node1:node2 => node1:node4 0.00263124 a=r:node4 |
571 | 49148d15 | Iustin Pop | 15. instance58 node19:node20 => node19:node17 0.00252594 a=r:node17 |
572 | 49148d15 | Iustin Pop | Cluster score improved from 0.52329131 to 0.00252594 |
573 | 49148d15 | Iustin Pop | |
574 | 49148d15 | Iustin Pop | Commands to run to reach the above solution: |
575 | 49148d15 | Iustin Pop | echo step 1 |
576 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance14 |
577 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance14 |
578 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance14 |
579 | 49148d15 | Iustin Pop | echo step 2 |
580 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance54 |
581 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance54 |
582 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance54 |
583 | 49148d15 | Iustin Pop | echo step 3 |
584 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance4 |
585 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance4 |
586 | 49148d15 | Iustin Pop | echo step 4 |
587 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node2 instance48 |
588 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance48 |
589 | 49148d15 | Iustin Pop | echo step 5 |
590 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance93 |
591 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance93 |
592 | 49148d15 | Iustin Pop | echo step 6 |
593 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node2 instance89 |
594 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance89 |
595 | 49148d15 | Iustin Pop | echo step 7 |
596 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance5 |
597 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance5 |
598 | 49148d15 | Iustin Pop | echo step 8 |
599 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance94 |
600 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance94 |
601 | 49148d15 | Iustin Pop | echo step 9 |
602 | 49148d15 | Iustin Pop | echo gnt-instance migrate instance44 |
603 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node15 instance44 |
604 | 49148d15 | Iustin Pop | echo step 10 |
605 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance62 |
606 | 49148d15 | Iustin Pop | echo step 11 |
607 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node16 instance13 |
608 | 49148d15 | Iustin Pop | echo step 12 |
609 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node7 instance19 |
610 | 49148d15 | Iustin Pop | echo step 13 |
611 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node1 instance43 |
612 | 49148d15 | Iustin Pop | echo step 14 |
613 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node4 instance1 |
614 | 49148d15 | Iustin Pop | echo step 15 |
615 | 49148d15 | Iustin Pop | echo gnt-instance replace-disks -n node17 instance58 |
616 | 49148d15 | Iustin Pop | |
617 | 49148d15 | Iustin Pop | Final cluster status: |
618 | 49148d15 | Iustin Pop | N1 Name t_mem f_mem r_mem t_dsk f_dsk pri sec p_fmem p_fdsk |
619 | 49148d15 | Iustin Pop | node1 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
620 | 49148d15 | Iustin Pop | node2 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
621 | 49148d15 | Iustin Pop | node3 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
622 | 49148d15 | Iustin Pop | node4 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
623 | 49148d15 | Iustin Pop | node5 32762 7280 6000 1861 1078 4 5 0.22221 0.57947 |
624 | 49148d15 | Iustin Pop | node6 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
625 | 49148d15 | Iustin Pop | node7 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
626 | 49148d15 | Iustin Pop | node8 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
627 | 49148d15 | Iustin Pop | node9 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
628 | 49148d15 | Iustin Pop | node10 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
629 | 49148d15 | Iustin Pop | node11 32762 7280 6000 1861 1022 4 4 0.22221 0.54951 |
630 | 49148d15 | Iustin Pop | node12 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
631 | 49148d15 | Iustin Pop | node13 32762 7280 6000 1861 1022 4 4 0.22221 0.54951 |
632 | 49148d15 | Iustin Pop | node14 32762 7280 6000 1861 1022 4 4 0.22221 0.54951 |
633 | 49148d15 | Iustin Pop | node15 32762 7280 6000 1861 1031 4 4 0.22221 0.55408 |
634 | 49148d15 | Iustin Pop | node16 32762 7280 6000 1861 1060 4 4 0.22221 0.57007 |
635 | 49148d15 | Iustin Pop | node17 32762 7280 6000 1861 1006 5 4 0.22221 0.54105 |
636 | 49148d15 | Iustin Pop | node18 32762 7280 6000 1396 761 4 2 0.22221 0.54570 |
637 | 49148d15 | Iustin Pop | node19 32762 7280 6000 1861 1026 4 4 0.22221 0.55179 |
638 | 49148d15 | Iustin Pop | node20 32762 13280 6000 1861 1089 3 5 0.40535 0.58565 |
639 | 49148d15 | Iustin Pop | |
640 | 49148d15 | Iustin Pop | Here we see, beside the step list, the initial and final cluster |
641 | 49148d15 | Iustin Pop | status, with the final one showing all nodes being N+1 compliant, and |
642 | 49148d15 | Iustin Pop | the command list to reach the final solution. In the initial listing, |
643 | 49148d15 | Iustin Pop | we see which nodes are not N+1 compliant. |
644 | 49148d15 | Iustin Pop | |
645 | 49148d15 | Iustin Pop | The algorithm is stable as long as each step above is fully completed, |
646 | 49148d15 | Iustin Pop | e.g. in step 8, both the migrate and the replace-disks are |
647 | 49148d15 | Iustin Pop | done. Otherwise, if only the migrate is done, the input data is |
648 | 49148d15 | Iustin Pop | changed in a way that the program will output a different solution |
649 | 49148d15 | Iustin Pop | list (but hopefully will end in the same state). |
650 | 49148d15 | Iustin Pop | |
651 | 49148d15 | Iustin Pop | SEE ALSO |
652 | 49148d15 | Iustin Pop | -------- |
653 | 49148d15 | Iustin Pop | |
654 | 49148d15 | Iustin Pop | **hspace**(1), **hscan**(1), **hail**(1), **ganeti**(7), |
655 | 49148d15 | Iustin Pop | **gnt-instance**(8), **gnt-node**(8) |
656 | 49148d15 | Iustin Pop | |
657 | 49148d15 | Iustin Pop | COPYRIGHT |
658 | 49148d15 | Iustin Pop | --------- |
659 | 49148d15 | Iustin Pop | |
660 | 26f7b098 | Iustin Pop | Copyright (C) 2009, 2010, 2011 Google Inc. Permission is granted to |
661 | 26f7b098 | Iustin Pop | copy, distribute and/or modify under the terms of the GNU General |
662 | 26f7b098 | Iustin Pop | Public License as published by the Free Software Foundation; either |
663 | 26f7b098 | Iustin Pop | version 2 of the License, or (at your option) any later version. |
664 | 49148d15 | Iustin Pop | |
665 | 49148d15 | Iustin Pop | On Debian systems, the complete text of the GNU General Public License |
666 | 49148d15 | Iustin Pop | can be found in /usr/share/common-licenses/GPL. |