Iustin Pop [Sun, 14 Dec 2008 12:02:53 +0000 (12:02 +0000)]
cleanup: LURenameCluster wrong variable name
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:02:45 +0000 (12:02 +0000)]
cleanup: fix export NIC count the same way as disk
For safety, we use the same algorithm as in disk count.
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:02:36 +0000 (12:02 +0000)]
cleanup: fix backend._RecursiveFindBD
_RecursiveFindBD takes a parameter that isn't used; moreover, nowhere in
the SVN history can I find a case that it has been used.
As such, remove this parameter and fix its callers.
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:02:27 +0000 (12:02 +0000)]
cleanup: more unused vars
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:02:18 +0000 (12:02 +0000)]
cleanup: sanitize a default parameter
Instead of relying that the usage of the parameter is ok with mutable
default parameters, let's just make it safer..
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:02:09 +0000 (12:02 +0000)]
cleanup: exceptions should derive from Exception
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:02:01 +0000 (12:02 +0000)]
cleanup: fix GatherMasterVotes
Remove unused vars
Reviewed-by: amishchenko
Iustin Pop [Sun, 14 Dec 2008 12:01:52 +0000 (12:01 +0000)]
cleanup: _InitSSHSetup doesn't need its argument
Reviewed-by: imsnah
Iustin Pop [Sun, 14 Dec 2008 12:01:41 +0000 (12:01 +0000)]
cleanup: fix 'variable unused' warning
In the iteration we don't care about the node names, so we change the
for loop to be over the values (and not itervalues).
Reviewed-by: amishchenko
Michael Hanselmann [Fri, 12 Dec 2008 16:50:41 +0000 (16:50 +0000)]
ganeti.http: Rename HttpBase._using_ssl to HttpBase.using_ssl
It'll be queried from other classes.
Reviewed-by: iustinp
Michael Hanselmann [Fri, 12 Dec 2008 16:50:27 +0000 (16:50 +0000)]
ganeti.http: Rename HttpSocketBase to HttpBase
It's more appropriate.
Reviewed-by: iustinp
Iustin Pop [Thu, 11 Dec 2008 17:13:30 +0000 (17:13 +0000)]
Fix epydoc format warnings
This patch should fix all outstanding epydoc parsing errors; as such, we
switch epydoc into verbose mode so that any new errors will be visible.
Reviewed-by: imsnah
Iustin Pop [Thu, 11 Dec 2008 14:58:01 +0000 (14:58 +0000)]
Switch epydoc to parse only
epydoc seems to be mightily confused by decorators and how they change
functions (it starts mixing the parameters of the decorated function
into the decorator itself); so we want it to parse only and not look at
the objects themselves.
Reviewed-by: ultrotter
Michael Hanselmann [Wed, 10 Dec 2008 12:11:47 +0000 (12:11 +0000)]
ganeti.backend: Improve compression check
Reviewed-by: iustinp
Michael Hanselmann [Wed, 10 Dec 2008 12:06:35 +0000 (12:06 +0000)]
ganeti.http: Docstring updates
Reviewed-by: iustinp
Michael Hanselmann [Tue, 9 Dec 2008 18:42:53 +0000 (18:42 +0000)]
ganeti.http: Remove _HttpClientError
This is a leftover from old code.
Reviewed-by: iustinp
Michael Hanselmann [Tue, 9 Dec 2008 17:35:53 +0000 (17:35 +0000)]
ganeti.http.server: Increase connection backlog to 1024
This solves a problem with many concurrent requests. By default, 1024
is the maximum backlog on Linux kernels. We limit the number of clients
through MAX_CHILDREN, too. The idea of just increasing the backlog is
taken from lighttpd.
Reviewed-by: amishchenko
Michael Hanselmann [Tue, 9 Dec 2008 13:24:26 +0000 (13:24 +0000)]
RPC: Compress file upload data
Adding compression to larger amounts of data is more efficient than
transferring it (len(nodes) - 1) times over the network without
compression. We were able to compress a 800KB config file to about
30 KB, which is about 40 KB with Base64 encoding (required due to
the way SimpleJson handles strings).
Reviewed-by: ultrotter
Iustin Pop [Tue, 9 Dec 2008 09:33:32 +0000 (09:33 +0000)]
Warn for instances living on offline nodes
The patch also changes the result to error for non-reachable secondary nodes
(as for primary nodes).
Reviewed-by: ultrotter
Iustin Pop [Mon, 8 Dec 2008 17:45:56 +0000 (17:45 +0000)]
Fix _AdjustCandidatePool
Currently the ConfigWriter.MaintainCandidatePool returns node names, and
_AdjustCandidatePool uses them as such, but then it passes these to
context.ReaddNode which in turn passes them to jqueue.JobQueue.AddNode which
uses them as objects.Node instances.
Since this is currently the only usage, we change return type from
ConfigWriter.MaintainCandidatePool to be objects and adjust the logging of
their names, so that the auto-adjusement works.
Reviewed-by: ultrotter
Iustin Pop [Mon, 8 Dec 2008 11:46:51 +0000 (11:46 +0000)]
gnt-node modify: add the offline attribute
This patch changes gnt-node modify and the associated opcode/lu to allow
modification of the node offline attribute.
Setting a node into offline mode automatically demotes it from the
master role.
Reviewed-by: ultrotter
Iustin Pop [Mon, 8 Dec 2008 09:10:48 +0000 (09:10 +0000)]
RPC: do not make calls to offline nodes
This patch changes the _MultNodeCall and _SingleNodeCall helpers to not
actually make calls to offline nodes, but instead generate fake
responses which have a parameter caller 'offline' set so that callers
can check for this value if they want (otherwise, it's just a failed RPC
call).
Reviewed-by: ultrotter
Guido Trotter [Sun, 7 Dec 2008 11:01:55 +0000 (11:01 +0000)]
chmod ganeti.initd before uploading it
When an upload is done to a node which doesn't have any version of
ganeti installed, this prevents a non-executable-initd error later in
the upload.
Reviewed-by: imsnah
Iustin Pop [Fri, 5 Dec 2008 11:41:33 +0000 (11:41 +0000)]
Make cluster verify understand offline nodes
This patch changes cluster verify to not alert on offline nodes, but
instead just show a note at the end with the number of such nodes.
It also removes warnings in verify-disks and hooks about failures to
make rpc calls to such nodes.
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 11:32:02 +0000 (11:32 +0000)]
cmdlib: check node stats in prereqs
This patch adds checks for offline nodes in most instance LUs so that we
can work with offline secondaries, but not with offline primaries. Some
cases (like grow disk, which needs both sides up) are not allowing
offline nodes at all.
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 11:20:20 +0000 (11:20 +0000)]
Add two utility functions to cmdlib
These will be used for parameter checking and node status checking.
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 11:14:19 +0000 (11:14 +0000)]
Add function to compute the master candidates
Since some nodes can be offline, we can't just take the length of the
node list as the maximum possible number of master candidates.
The patch adds an utility function to correctly compute this value and
replaces hardcoded computations with the use of this function. It then
adds utility functions to automate the maintenance of the node lists.
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 10:12:58 +0000 (10:12 +0000)]
http: use slicing instead of string modification
The combination of the current buffer splitting method and (4KB) buffer
size is very inefficient when writing big amounts of data. Just walking
over a 16 megabyte string using a 4K buffer takes (on a random computer)
1m06s, whereas using slices will decrease this to 0.080s, and slicing
with 32 KB size decreases this to 0.073s.
This means that uploading a big config file (it nears 1MB for big
clusters) will take more and more time per the number of nodes, since it
needs lots of slicing.
I happened upon this by accidentally setting all nodes as master
candidates, at which point just uploading the config file to all nodes
took 40s. Applying the patch decreases this to 15s (this probably can
still be optimized).
The patch also removes a duplicate constant (the one actually used is in
http/client.py), and changes the receive buffer size to use the same
constant.
Reviewed-by: imsnah
Iustin Pop [Fri, 5 Dec 2008 10:12:45 +0000 (10:12 +0000)]
Add the offline node list to ssconf
The patch also changes the various node list generation to be more
consistent.
Reviewed-by: imsnah
Iustin Pop [Fri, 5 Dec 2008 03:01:21 +0000 (03:01 +0000)]
Cleanup the config file on demotion from candidate
This patch adds a simple rpc which makes a backup of the config file and
then removes it. This is done so that cluster verify doesn't complain
immediately after demoting a node.
Reviewed-by: imsnah
Iustin Pop [Fri, 5 Dec 2008 02:58:40 +0000 (02:58 +0000)]
watcher: handle offline nodes better
This patch changes the LUQueryInstances to show a different state for
offline nodes and also modifies the watcher to understand the offline
state in its checks.
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 02:53:33 +0000 (02:53 +0000)]
node list: add the offline field
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 02:53:21 +0000 (02:53 +0000)]
Add a new node parameter 'offline'
This patch adds a new node parameter called offline that will be used to
mark nodes which should be touched by commands.
We also add this flag at cluster init, node add, and export it to
iallocator scripts.
Reviewed-by: ultrotter
Iustin Pop [Fri, 5 Dec 2008 02:42:18 +0000 (02:42 +0000)]
ssconf: empty files should not add a newline
Currently we add a newline in the ssconf writeout process, even if the
file is empty. We chage this case so that lists of values (e.g. offline
nodes) are correct (not a list of one empty element).
Reviewed-by: imsnah
Michael Hanselmann [Thu, 4 Dec 2008 15:25:26 +0000 (15:25 +0000)]
ganeti.http: Add constant for DELETE
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:25:12 +0000 (15:25 +0000)]
Remove old HTTP code
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:24:52 +0000 (15:24 +0000)]
ganeti.rpc: Convert to new HTTP server
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:24:14 +0000 (15:24 +0000)]
ganeti-rapi: Convert to new HTTP server
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:23:50 +0000 (15:23 +0000)]
ganeti-noded: Migrate to new HTTP server
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:23:38 +0000 (15:23 +0000)]
ganeti.http: Split HTTP server and client into separate files
This includes a large rewrite of the HTTP server code. The handling of
OpenSSL errors had some problems that were hard to fix with its
structure. When preparing all of this, I realized that actually HTTP
is a message protocol and that the same code can be used on both the
server and client side to parse requests/responses, with only a few
differences. There are still a few TODOs in the code, but none should
be a show stopper. Many pylint warnings have been fixed, too.
The old code will be removed once all users have been migrated.
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:23:17 +0000 (15:23 +0000)]
Rename all HTTP classes to camel case
It should be consistent.
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:22:54 +0000 (15:22 +0000)]
ganeti.http: Remove underline from two classes
This is a preparation step for splitting the HTTP client and server code
into two separate modules.
Reviewed-by: amishchenko
Michael Hanselmann [Thu, 4 Dec 2008 15:22:41 +0000 (15:22 +0000)]
Move HTTP code to subpackage
This is a preparation step for splitting the HTTP client and server code
into two separate modules.
Reviewed-by: amishchenko
Guido Trotter [Thu, 4 Dec 2008 14:52:40 +0000 (14:52 +0000)]
LURemoveNode, promote nodes to master candidates
If after the remove node there are not enough master candidates, we'll
try to promote them.
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 17:23:22 +0000 (17:23 +0000)]
LUQueryExports: fix rpcresult handling
call_export_list is a multi node call, so we need to go through the
results, extrapolate the good ones, and return a failure value for the
bad ones.
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 17:23:08 +0000 (17:23 +0000)]
LUAddNode: Auto-make master candidates
When a node is added, if there are not enough master candidates, we'll
automatically promote it.
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 17:22:53 +0000 (17:22 +0000)]
LUAddNode: Check the correct result
This is a typo in the conversion to RpcResult
Reviewed-by: imsnah
Michael Hanselmann [Wed, 3 Dec 2008 16:09:46 +0000 (16:09 +0000)]
ganeti.http: Fix copyright header
Reviewed-by: ultrotter
Michael Hanselmann [Wed, 3 Dec 2008 16:09:34 +0000 (16:09 +0000)]
ganeti.http: Remove unused attribute "should_fork"
This is a leftover from removed code.
Reviewed-by: ultrotter
Michael Hanselmann [Wed, 3 Dec 2008 16:09:20 +0000 (16:09 +0000)]
ganeti.http: Move request handling logic from server to handler class
Reviewed-by: ultrotter
Michael Hanselmann [Wed, 3 Dec 2008 16:09:04 +0000 (16:09 +0000)]
ganeti.http: Move _SocketOperation to module-level function
This is a preparation step to move the HTTP server class to the
same model as the HTTP client (polling, non-blocking I/O, better
OpenSSL error handling).
Reviewed-by: ultrotter
Michael Hanselmann [Wed, 3 Dec 2008 16:08:51 +0000 (16:08 +0000)]
ganeti.http: Move _WaitForCondition into module-level function
Reviewed-by: ultrotter
Michael Hanselmann [Wed, 3 Dec 2008 16:08:25 +0000 (16:08 +0000)]
ganeti.http: Remove ApacheLogfile class
We don't need it anymore and it wouldn't work as it is, anyway.
Reviewed-by: ultrotter
Guido Trotter [Wed, 3 Dec 2008 11:12:55 +0000 (11:12 +0000)]
InitCluster force a config file update
After the cluster is ready we'll load the ConfigWriter and force a
writeout of all config files.
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 11:12:43 +0000 (11:12 +0000)]
Make sure the initial node is a master candidate
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 11:12:30 +0000 (11:12 +0000)]
gnt-cluster init, handle candidate_pool_size
- Add a new command line option, defaulting to the constant value
- Pass the value to bootstrap.InitCluster
- Use it to init the new Cluster object
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 11:12:16 +0000 (11:12 +0000)]
Add the MASTER_POOL_SIZE_DEFAULT constant
This constant will be used at cluster init time.
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 11:12:04 +0000 (11:12 +0000)]
CheckBEParams handle a bool BE_AUTO_BALANCE
This only happens at cluster init, if the value is not user-specified.
Reviewed-by: imsnah
Guido Trotter [Wed, 3 Dec 2008 10:28:50 +0000 (10:28 +0000)]
Extract the ListNodes headers and use them in help
Currently we have to update both the ListNodes headers and the online
help for the full field list. This patch uses the headers keys for the
help, thus removing duplicating places to update, and adding hope that
we'll have things in sync. As a downside we lose ordering of the
non-default fields in the online help.
Reviewed-by: imsnah
Iustin Pop [Wed, 3 Dec 2008 09:57:22 +0000 (09:57 +0000)]
A few fixes related to master candidates
This patch:
- fixes cluster verify when all nodes are master candidates, but the
candidate_pool_size is higher
- warn when the master node is not marked as candidate
- disable setting master node to regular node
- don't pass the master node to context.ReaddNode since the job queue
doesn't like getting our own node name
Reviewed-by: ultrotter
Iustin Pop [Wed, 3 Dec 2008 09:55:59 +0000 (09:55 +0000)]
Fix cluster rename and known_hosts
This patch rewrites and distributes ganeti's known_hosts file in case of
a cluster rename.
We also fix a problem in the node add (from where I copied the
known_hosts file distribution).
Reviewed-by: ultrotter
Guido Trotter [Tue, 2 Dec 2008 14:49:21 +0000 (14:49 +0000)]
Fix hooks_unittest with new rpc call structure
Reviewed-by: iustinp
Iustin Pop [Tue, 2 Dec 2008 14:35:02 +0000 (14:35 +0000)]
Fix gnt-cluster verify w.r.t. rpc changes
This partially reorganizes the cluster verify LU:
- introduce constants for the node verify rpc call
- move from additional rpc calls to a single rpc call, the
call_node_info, which gaters all data needed
Also fix a small error (self.LogWarning instead of self.Warning).
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 12:58:42 +0000 (12:58 +0000)]
Fix cluster rename
With the recent configwriter/ssconf changes, cluster rename becomes
trivial. This patch gets rids of the code and just updates the cluster
object.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 12:58:31 +0000 (12:58 +0000)]
Convert rpc results to a custom type
For a long time we had the problem that both RPC-layer errors and
results from the remote node share the same "valuespace". This is
because we shouldn't raise an exception when only one node failed
(and lose the results from the other nodes).
This patch attempts to address this problem by returning a special
object from RPC calls, which separates the rpc-layer status and the
remote results into different attributes.
All the users of rpc (mainly cmdlib, but also bootstrap and the
HooksMaster in mcpu) have been converted to this new model. The code has
changed from, e.g. for boolean return types:
if not self.rpc.call_...
to
result = self.rpc.call_
if result.failed or not result.data:
^ rpc-layer error |
- result payload
While this is slightly more complicated, it will allow cleaner checks in
the future; right now the code is just a plain port, without
optimizations.
There's also a "result.Raise()" which raises an OpExecError if the
rpc-layer had errors.
One side-effect of the patch is that now all return types from the
rpc.call_ functions are of either RpcResult (single-node) or dicts of
(node name, RpcResult); previously, some functions were returning
different object types based on error status.
The code passes burnin (after many retries :).
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 12:58:15 +0000 (12:58 +0000)]
burnin: add instance reinstall and reboot
These two operations were missing from burnin. The reboot is done with
all valid modes (a new constant is added), and the reinstall is done
both with and without specifying the OS (to account for the two code
paths in the LU).
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 12:58:00 +0000 (12:58 +0000)]
burnin: don't do export/import for file storage
This is currently not supported, so don't try to do export/import in
this case.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:54:37 +0000 (10:54 +0000)]
KVMHypervisor add two missing 'constants.'
Some calls to the HV parameters were missing them.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:54:21 +0000 (10:54 +0000)]
KVMHypervisor fix to case misspellings
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:54:03 +0000 (10:54 +0000)]
cluster init: don't discard the hypervisor
On cluster init if the user specifies a default hypervisor (with -t)
which is not in the default list of enabled hypervisors (currently just
xen-pvm) without explicitely specifying the list we silently override
the choice.
With this patch we set the list by default to just the required one, and
we bail out should the list be hand-specified and not contain the
default one. This still has an issue when the user doesn't specify a
default hypervisor but specifies a list which doesn't include xen-pvm:
in this case though we give an error, rather than silently discarding
choices.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:53:25 +0000 (10:53 +0000)]
Use the new utils.CheckBEParams function
Where we used/forgot to validate beparams we now use the new common function.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:53:14 +0000 (10:53 +0000)]
Add utils.CheckBEParams
This function will be used in LUCreateInstance, LUSetInstanceParams,
LUSetClusterParams and InitCluster to check the backend parameters
validity and convert the relevant values to integer, without duplicating
code. It lives in utils as bootstrap.py is calling it too.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:53:01 +0000 (10:53 +0000)]
Add constants.VALUE_TRUE and VALUE_FALSE
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:25:48 +0000 (10:25 +0000)]
Handle default/none values in hv/be params
When a value is set to constants.VALUE_DEFAULT we have to remove it from
the specific instance dict, as this way it will be populated from the
cluster before. If instead it's specified as constants.VALUE_NONE we'll
explicitely set it to None, to override its presence with a different
values in such defaults. However, currently, we handle None values only
for hvparams, that have a real use case for them.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:25:34 +0000 (10:25 +0000)]
SetInstanceParams: handle default/none values
If the hv/be parameter lowercase value is set to "default" we'll pass
constants.VALUE_DEFAULT, if it's set to "none" we'll pass
constants.VALUE_NONE.
Reviewed-by: imsnah
Guido Trotter [Tue, 2 Dec 2008 10:19:43 +0000 (10:19 +0000)]
Update gnt-backup online help
--src-node and --src-dir are not mandatory anymore
Reviewed-by: iustinp
Guido Trotter [Tue, 2 Dec 2008 10:19:30 +0000 (10:19 +0000)]
ImportExport: make src_node and src_path optional
If src_node is not there we'll default to using the currently exported
instance name as src_path. Also, if src_path is not absolute we'll look
for it in EXPORT_DIR.
Reviewed-by: iustinp
Guido Trotter [Tue, 2 Dec 2008 10:19:16 +0000 (10:19 +0000)]
LUCreateInstance: handle import without src_node
If we get called with no source node we'll thread src_path as an
instance name exported in EXPORT_DIR in one of the nodes and look for
it with the export_list rpc call.
Reviewed-by: iustinp
Guido Trotter [Tue, 2 Dec 2008 10:19:02 +0000 (10:19 +0000)]
LUCreateInstance: keep src node lock on import
Currently the node lock also guards against removing the import at the
wrong time, so if we're importing an instance image we want to keep the
source node locked. In the future we might want to put export locks at a
different level than node locks.
Reviewed-by: iustinp
Iustin Pop [Tue, 2 Dec 2008 05:07:01 +0000 (05:07 +0000)]
Fix master failover
The ssconf files were not updated by the master failover. We need to
push them, and since we already have RPC initialized, we can use the
standard ConfigWriter to do so - this will take care of both the config
file and the ssconf files.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:06:47 +0000 (05:06 +0000)]
Adjust cluster-verify to check for candidate role
Currently cluster verify checks all nodes for the same set of files,
even if the nodes are not master candidates.
This patch adds back checking of ssconf files for consistency and splits
the checksum check into different error reporting messages based on
candidate role.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:06:34 +0000 (05:06 +0000)]
Add candidate pool size checks in verify
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:06:21 +0000 (05:06 +0000)]
Prevent demotion from candidate based on pool size
In gnt-cluster modify we prevent demotion from the candidate role if
there are not enough master candidates left.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:06:08 +0000 (05:06 +0000)]
Add cluster candidate pool size parameter
This patch adds a new cluster paramater "candidate_pool_size" which
tracks the desired size of the list of nodes with the master_candidate
flag set.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:05:53 +0000 (05:05 +0000)]
Prevent master failover to a non candidate node
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:05:40 +0000 (05:05 +0000)]
Add the list of master candidates to ssconf
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:05:25 +0000 (05:05 +0000)]
Restrict job propagation to master candidates only
This patch restricts the job propagation to master candidates only, by
not registering non-candidates in the job queue node lists.
Note that we do intentionally purge the job queue if a node is toggled
to non-master status.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:05:12 +0000 (05:05 +0000)]
Restrict config replication to master candidates
This patch restricts the config data replication to master candidates
only.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:05:01 +0000 (05:05 +0000)]
Add a gnt-node modify operation
This patch adds the OpCode, LogicalUnit and gnt-node command for
modifying node parameters, more specifically the master candidate flag
for a node.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:04:44 +0000 (05:04 +0000)]
Add master/master_candidate fields to node list
This patch adds listing of the master_candidate field (as Y/N) and of
the master role (again Y/N) for nodes.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:04:28 +0000 (05:04 +0000)]
Introduce a new 'master_candidate' node attribute
The field is not yet used.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:04:17 +0000 (05:04 +0000)]
Simplify a little the ssconf update
We have (again) the KeyToFilename function, so we move the writing of
the files to a method under SimpleStore.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:04:05 +0000 (05:04 +0000)]
Replicate the node list in ssconf
This patch adds node_list in the list of replicated values from
ConfigWriter.
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 05:03:52 +0000 (05:03 +0000)]
Revert "Get rid of ssconf"
This partially reverts the "Get rid of ssconf" patch.
It adds back a simpler version of the SimpleStore class, and drops the
WritableSimpleStore class. The new version of the class also has
node_list as a new key, and increases the size of the keys so that big
clusters will fit the node list. Also, the SS_* constants are moved to
constants.py, since the ConfigWriter class will need them too in order
to generate the values dictionary.
It also changes the GetMasterAndMyself function to use the SimpleStore
by default, and the backend._GetConfig to use it too (it has all the
needed keys).
Reviewed-by: imsnah
Iustin Pop [Tue, 2 Dec 2008 01:41:00 +0000 (01:41 +0000)]
burnin: fix usage of diskless template
This allows burnin to work with diskless instances (since we cannot pass
right now no disks to it).
Reviewed-by: imsnah
Michael Hanselmann [Mon, 1 Dec 2008 20:52:31 +0000 (20:52 +0000)]
Update QA scripts to new cluster parameters
There are still issues, especially with "gnt-instance modify" and
resetting values. However, this is a start.
Reviewed-by: ultrotter
Michael Hanselmann [Mon, 1 Dec 2008 20:52:17 +0000 (20:52 +0000)]
gnt-instance add: Remove "--os-size" and "--swap-size"
They're not used anymore.
Reviewed-by: ultrotter
Michael Hanselmann [Mon, 1 Dec 2008 20:52:03 +0000 (20:52 +0000)]
Fix RpcRunner._StaticSingleNodeCall
Unfortunately, a rpc.Client object was passed as the first parameter,
causing the function to always fail.
Found during QA testing.
Reviewed-by: ultrotter
Guido Trotter [Mon, 1 Dec 2008 15:47:12 +0000 (15:47 +0000)]
InitCluster: initialize master node serial_no
Currently it was left alone, and thus its value was "null".
Reviewed-by: imsnah
Iustin Pop [Mon, 1 Dec 2008 06:02:06 +0000 (06:02 +0000)]
Fix errors when the node info RPC is incomplete
[Forward-port from the 1.2 branch]
If ganeti starts before xend, the node information will not have all the
fields filled in. The patch changes so that missing keys will be treated
as unknown (this applies to other cases as well, not only xend not
started).
Reviewed-by: ultrotter