Statistics
| Branch: | Tag: | Revision:

root / NEWS @ b1ddaad9

History | View | Annotate | Download (11.5 kB)

1
Ganeti-htools release notes
2
===========================
3

    
4
Version 0.2.5 (Mon, 26 Apr 2010)
5
--------------------------------
6

    
7
Some internal cleanup plus a few user-visible changes:
8

    
9
- new option for marking instances as 'do-not-move' during rebalancing
10
- allow ``hscan`` to scan the local cluster via Luxi
11
- add more metrics to ``hspace`` which show the delta between original
12
  state and final state better (only valid for tiered allocation)
13

    
14
Version 0.2.4 (Mon, 22 Feb 2010)
15
--------------------------------
16

    
17
Two improvements for node evacuation:
18

    
19
- hbal takes a new parameter ``--evac-mode`` that restricts the
20
  instances to be moved to the ones on offline/drained nodes, which
21
  should reduce the work done
22
- hail supports the new ``multi-evacuate`` mode of the IAllocator
23
  protocol, that will be released in a minor release on the Ganeti 2.1
24
  branch
25

    
26
Version 0.2.3 (Thu,  4 Feb 2010)
27
--------------------------------
28

    
29
A small release:
30

    
31
- Fixes selection of secondary node: previously, if the cluster had
32
  many N+1 failures, a N+1 failed node could be selected as secondary
33
  even if it did not have enough memory to allow the instance to be
34
  migrated/failed over to it; this is bad for automated tools, since
35
  we can get the cluster in an unhealthy state
36
- Switch the text backend to a single input file, that is generated
37
  now by hscan and shouldn't be generated manually via
38
  gnt-node/instance list anymore; this allows richer information to be
39
  kept in the file, and simplifies a little the internals of the text
40
  backend
41

    
42
Version 0.2.2 (Tue, 29 Dec 2009)
43
--------------------------------
44

    
45
Small release, 0.2.1 was broken and thus this was released earlier:
46

    
47
- Release 0.2.1 broke the LUXI backend due to a typo, fixed
48
- Added a live-test script that should catch errors like the above one
49
  in the future (needs a working, non-empty cluster)
50
- Changed RAPI and LUXI backends to treat drained nodes as offline,
51
  similar to the IAllocator backend change in 0.2.0 (which was wrongly
52
  marked as affecting all backends)
53
- Changed the metrics for offline instances and N1 score from percent to
54
  count, in order to increase the priority of evacuations
55
- Added a new metric (offline primary instances) which should fix the
56
  evacuation of a offline node in a 2-node cluster
57

    
58
Version 0.2.1 (Wed,  2 Dec 2009)
59
--------------------------------
60

    
61
- Added instance exclusion defined via instance tags
62
- Fixed the output of hspace to be again parseable from the shell
63

    
64
Version 0.2.0 (Tue, 10 Nov 2009)
65
--------------------------------
66

    
67
A significant release, with a few new major features:
68

    
69
- Added direct execution of the hbal solution when using the Luxi
70
  backend; the steps for each instance moves are submitted as a single
71
  jobs, and the different jobs are submitted as groups in order to
72
  parallelise the execution of moves
73
- Added support for balancing based on dynamic utilisation data for
74
  instances, fed in via a text file; by default, all instances are
75
  considered equal and this change also improves the equalisation of
76
  secondary instances per node
77
- Added support for tiered capacity calculation in hspace, where we
78
  start from a maximum instance spec and decrease the spec when we run
79
  out of resources; this should give a better measure of available
80
  capacity on 'fragmented' clusters; this is done separately from the
81
  current fixed-mode computation
82

    
83
Also there have been many minor improvements:
84

    
85
- Added option for showing instances (“--print-instances”), similar to
86
  the print nodes option
87
- Added support for customising the node list via an argument to the
88
  print nodes option in the form of a comma-separated list of field
89
  names; currently the field names are not documented, expecting further
90
  changes in a next release
91
- Enhanced the error reporting in the Luxi and Rapi backends
92
- Changed the handling of drained nodes, now being treated the same as
93
  offline nodes, for Ganeti 2.0.4+ compatibility
94
- A number of internal changes, simplifying code and merging some
95
  disparate functions
96
- Simplify the build system in relation to creation of archives
97

    
98
Version 0.1.8 (Tue, 29 Sep 2009)
99
--------------------------------
100

    
101
- Brown-paper-bag release fixing haddock issues
102

    
103
Version 0.1.7 (Mon, 28 Sep 2009)
104
--------------------------------
105

    
106
- Fixed a bug in the Luxi backend for big responses
107
- Fixed test suite exit code in presence of test failures
108
- Changed the migrate operation to run instead failover for instances
109
  which were marked as not running in the input data (this could have
110
  been changed since then, but it's better than today's always migrate)
111
- Added support for 'cheap' moves only (only migrate/failover) in
112
  balancing
113
- Added support for building without curl (thus no RAPI backend)
114

    
115
Version 0.1.6 (Wed, 19 Aug 2009)
116
--------------------------------
117

    
118
- Added support for Luxi (the native Ganeti protocol)
119
- Added support for simulated clusters (for hspace only)
120
- Added timeouts for the RAPI backend
121
- Fixed a few inconsistencies in the command line handling
122
- Fixed handling of errors while loading data
123
- The 'network' is a new dependency due to the Luxi addition
124

    
125
Version 0.1.5 (Thu, 09 Jul 2009)
126
--------------------------------
127

    
128
- Removed obsolete hn1 program; this allowed removal of a lot of
129
  supporting code
130
- Lots of changes in hspace: the output now is a shell fragment in order
131
  for script to source it or parse it easier; added failure reasons;
132
  optimised to use less memory for large clusters
133
- Optimized the scoring algorithm (used by all tools) so that now
134
  computations should be faster
135

    
136
Version 0.1.4 (Tue, 16 Jun 2009)
137
--------------------------------
138

    
139
- Added CPU count/ratio of virtual-to-physical CPUs to the cluster
140
  scoring methods; this means that now the balancer, the iallocator
141
  plugin and so on will try to keep the VCPU-to-PCPU ratio equal across
142
  the cluster
143
- Fixed some hscan bugs
144
- Fixed the way iallocator reads the total disk size (was broken and it
145
  was always falling back to summing the disk sizes)
146
- Internals: fixed most compile-time warnings
147

    
148
Version 0.1.3 (Fri, 05 Jun 2009)
149
--------------------------------
150

    
151
- Fix a bug in the ReplacePrimary instance moves, affecting most of the
152
  tools
153

    
154
Version 0.1.2 (Tue, 02 Jun 2009)
155
--------------------------------
156

    
157
- Add a new program, “hspace”, which computes the free space on a
158
  cluster (based on a given instance spec)
159
- Improvements in API docs and partially in the user docs
160
- Started adding unittests
161

    
162
Version 0.1.1 (Tue, 26 May 2009)
163
--------------------------------
164

    
165
- Add a new program, “hail”, which is an iallocator plugin and can
166
  allocate/relocate instances
167
- Experimental support for non-mirrored instances (hail supports them,
168
  hbal should no longer abort when it finds such instances and simply
169
  ignore them)
170
- The RAPI port and/or scheme can be overriden now, and even “file://”
171
  schemes can be used if the message body has been saved under the
172
  appropriate name
173
- Lots of code reorganization, esp. rewritten loading pipeline
174
- Better data checking and better error messages in case validation
175
  fails; tools now consider nodes with error in input data (‘?’ returned
176
  by ganeti) as offline
177
- Small enhancement to the makefile for simpler packaging
178

    
179
Version 0.1.0 (Tue, 19 May 2009)
180
--------------------------------
181

    
182
- Drop compatibility with Ganeti 1.2
183
- Add a new minimum score option (with a very low default), should help
184
  with very good clusters (but is still not optimal)
185
- Add a --quiet option to hbal
186
- Add support for reading offline nodes directly from the cluster
187

    
188
Version 0.0.8 (Tue, 21 Apr 2009)
189
--------------------------------
190

    
191
- hbal: prevent mismatches in wrong node names being passed to -O, by
192
  aborting in this case
193
- add the ability to write the commands (-C) to a script via (-C<file>),
194
  so that it can be later executed directly; this has also changed the
195
  commands to include the ncessary -f flags to skip confirmations
196
- add checks for extra argument in hbal and hn1, so that unintended
197
  errors are catched
198
- raise the accepted “missing” memory limit to 512MB, to cover usual Xen
199
  reservations
200

    
201
Version 0.0.7 (Mon, 23 Mar 2009)
202
--------------------------------
203

    
204
- added support for offline nodes, which are not used as targets for
205
  instance relocation and if they hold instances the hbal algorithm will
206
  attempt to relocate these away
207
- added support for offline instances, which now will no longer skew the
208
  free memory estimation of nodes; the algorithm will no longer create
209
  conditions for N+1 failures when such instances are later started
210
- implemented a complete model of node resources, in order to prevent an
211
  unintended re-occurrence of cases like the offline instance were we
212
  miscalculate some node resource; this gives warning now in case the
213
  node reported free disk or free memory deviates by more than a set
214
  amount from the expected value
215
- a new tool *hscan* that can generate the input text-file for the other
216
  tools by collection via RAPI
217
- some small changes to the build system to make it more friendly; also
218
  included the generated documentation in the source archive
219

    
220
Version 0.0.6 (Mon, 16 Mar 2009)
221
--------------------------------
222

    
223
- re-factored the hbal algorithm to make it stable in the sense that it
224
  gives the same solution when restarted from the middle; barring
225
  rounding of disk/memory and incomplete reporting from Ganeti (for
226
  1.2), it should be now feasible to rely on its output without
227
  generating moves ad infinitum
228
- the hbal algorithm now uses two more variables: the node N+1 failures
229
  and the amount of reserved memory; the first of which tries to ‘fix’
230
  the N+1 status, the latter tries to distribute secondaries more
231
  equally
232
- the hbal algorithm now uses two more moves at each step:
233
  replace+failover and failover+replace (besides the original failover,
234
  replace, and failover+replace+failover)
235
- slightly changed the build system to embed GIT version/tags into the
236
  binaries so that we know for a binary from which tree it was done,
237
  either via ‘--version’ or via “strings hbal|grep version”
238
- changed the solution list and in general the hbal output to be more
239
  clear by default, and changed “gnt-instance failover” to “gnt-instance
240
  migrate”
241
- added man pages for the two binaries
242

    
243
Version 0.0.5 (Mon, 09 Mar 2009)
244
--------------------------------
245

    
246
- a few small improvements for hbal (possibly undone by later changes),
247
  hbal is now quite faster
248
- fix documentation building
249
- allow hbal to work on non N+1 compliant clusters, but without
250
  guarantees that the end cluster will be compliant; in any case, this
251
  should give a smaller number of nodes that are not compliant if the
252
  cluster state permits it
253
- strip common domain suffix from nodes and instances, so that output is
254
  shorter and hopefully clearer
255

    
256
Version 0.0.4 (Sun, 15 Feb 2009)
257
--------------------------------
258

    
259
- better balancing algorithm in hbal
260
- implemented an RAPI collector, now the cluster data can be gathered
261
  automatically via RAPI and doesn't need manual export of node and
262
  instance list
263

    
264
Version 0.0.3 (Wed, 28 Jan 2009)
265
--------------------------------
266

    
267
- initial release of the hbal, a cluster rebalancing tool
268
- input data format changed due to hbal requirements
269

    
270
Version 0.0.2 (Tue, 06 Jan 2009)
271
--------------------------------
272

    
273
- fix handling of some common cases (cluster N+1 compliant from the
274
  start, too big depth given, failure to compute solution)
275
- add option to print the needed command list for reaching the proposed
276
  solution
277

    
278
Version 0.0.1 (Tue, 06 Jan 2009)
279
--------------------------------
280

    
281
- initial release of hn1 tool
282

    
283
.. vim: set textwidth=72 :
284
.. Local Variables:
285
.. mode: rst
286
.. fill-column: 72
287
.. End: