Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ 0a8bf8fb

History | View | Annotate | Download (69.1 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
More info about Pithos can be found here: https://code.grnet.gr/projects/pithos
23

    
24
Document Revisions
25
^^^^^^^^^^^^^^^^^^
26

    
27
=========================  ================================
28
Revision                   Description
29
=========================  ================================
30
0.8 (Jan 19, 2012)         Update allowed versioning values.
31
\                          Change policy/meta formatting in JSON/XML replies.
32
\                          Document that all non-ASCII characters in headers should be URL-encoded.
33
\                          Support metadata-based queries when listing objects at the container level.
34
\                          Note Content-Type issue when using the internal django web server.
35
\                          Add object UUID field.
36
\                          Always reply with the MD5 in the ETag.
37
\                          Note that ``/login`` will only work if an external authentication system is defined.
38
0.7 (Nov 21, 2011)         Suggest upload/download methods using hashmaps.
39
\                          Propose syncing algorithm.
40
\                          Support cross-account object copy and move.
41
\                          Pass token as a request parameter when using ``POST`` via an HTML form.
42
\                          Optionally use source account to update object from another object.
43
\                          Use container ``POST`` to upload missing blocks of data.
44
\                          Report policy in account headers.
45
\                          Add insufficient quota reply.
46
\                          Use special meta to always report Merkle hash.
47
0.6 (Sept 13, 2011)        Reply with Merkle hash as the ETag when updating objects.
48
\                          Include version id in object replace/change replies.
49
\                          Change conflict (409) replies format to text.
50
\                          Tags should be migrated to a meta value.
51
\                          Container ``PUT`` updates metadata/policy.
52
\                          Report allowed actions in shared object replies.
53
\                          Provide ``https://hostname/login`` for Shibboleth authentication.
54
\                          Use ``hashmap`` parameter in object ``GET``/``PUT`` to use hashmaps.
55
0.5 (July 22, 2011)        Object update from another object's data.
56
\                          Support object truncate.
57
\                          Create object using a standard HTML form.
58
\                          Purge container/object history.
59
\                          List other accounts that share objects with a user.
60
\                          List shared containers/objects.
61
\                          Update implementation guidelines.
62
\                          Check preconditions when creating/updating objects.
63
0.4 (July 01, 2011)        Object permissions and account groups.
64
\                          Control versioning behavior and container quotas with container policy directives.
65
\                          Support updating/deleting individual metadata with ``POST``.
66
\                          Create object using hashmap.
67
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
68
\                          Allow for publicly available objects via ``https://hostname/public``.
69
\                          Support time-variant account/container listings. 
70
\                          Add source version when duplicating with ``PUT``/``COPY``.
71
\                          Request version in object ``HEAD``/``GET`` requests (list versions with ``GET``).
72
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
73
\                          Include underlying storage characteristics in container meta.
74
\                          Support for partial object updates through ``POST``.
75
\                          Expose object hashmaps through ``GET``.
76
\                          Support for multi-range object ``GET`` requests.
77
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
78
=========================  ================================
79

    
80
Pithos Users and Authentication
81
-------------------------------
82

    
83
In Pithos, each user is uniquely identified by a token. All API requests require a token and each token is internally resolved to an account string. The API uses the account string to identify the user's own files, thus whether a request is local or cross-account.
84

    
85
Pithos does not keep a user database. For development and testing purposes, user identifiers and their corresponding tokens can be defined in the settings file. However, Pithos is designed with an external authentication service in mind. This service must handle the details of validating user credentials and communicate with Pithos via a middleware software component that, given a token, fills in the internal request account variable.
86

    
87
Client software using Pithos, if not already knowing a user's identifier and token, should forward to the ``/login`` URI. The Pithos server, depending on its configuration will redirect to the appropriate login page.
88

    
89
The login URI accepts the following parameters:
90

    
91
======================  =========================
92
Request Parameter Name  Value
93
======================  =========================
94
next                    The URI to redirect to when the process is finished
95
renew                   Force token renewal (no value parameter)
96
======================  =========================
97

    
98
When done with logging in, the service's login URI should redirect to the URI provided with ``next``, adding ``user`` and ``token`` parameters, which contain the account and token fields respectively.
99

    
100
A user management service that implements a login URI according to these conventions is Astakos (https://code.grnet.gr/projects/astakos), by GRNET.
101

    
102
The Pithos API
103
--------------
104

    
105
The URI requests supported by the Pithos API follow one of the following forms:
106

    
107
* Top level: ``https://hostname/v1/``
108
* Account level: ``https://hostname/v1/<account>``
109
* Container level: ``https://hostname/v1/<account>/<container>``
110
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
111

    
112
All requests must include an ``X-Auth-Token`` - as a header, or a parameter.
113

    
114
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
115

    
116
=========================  ================================
117
Return Code                Description
118
=========================  ================================
119
400 (Bad Request)          The request is invalid
120
401 (Unauthorized)         Missing or invalid token
121
403 (Forbidden)            Request not allowed
122
404 (Not Found)            The requested resource was not found
123
503 (Service Unavailable)  The request cannot be completed because of an internal error
124
=========================  ================================
125

    
126
Top Level
127
^^^^^^^^^
128

    
129
List of operations:
130

    
131
=========  ==================
132
Operation  Description
133
=========  ==================
134
GET        Authentication (for compatibility with the OOS API) or list allowed accounts
135
=========  ==================
136

    
137
GET
138
"""
139

    
140
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
141

    
142
================  =====================
143
Return Code       Description
144
================  =====================
145
204 (No Content)  The request succeeded
146
================  =====================
147

    
148
If an ``X-Auth-Token`` is already present, the operation will be interpreted as a request to list other accounts that share objects to the user.
149

    
150
======================  =========================
151
Request Parameter Name  Value
152
======================  =========================
153
limit                   The amount of results requested (default is 10000)
154
marker                  Return containers with name lexicographically after marker
155
format                  Optional extended reply type (can be ``json`` or ``xml``)
156
======================  =========================
157

    
158
The reply is a list of account names.
159
If a ``format=xml`` or ``format=json`` argument is given, extended information on the accounts will be returned, serialized in the chosen format.
160
For each account, the information will include the following (names will be in lower case and with hyphens replaced with underscores):
161

    
162
===========================  ============================
163
Name                         Description
164
===========================  ============================
165
name                         The name of the account
166
last_modified                The last account modification date (regardless of ``until``)
167
===========================  ============================
168

    
169
Example ``format=json`` reply:
170

    
171
::
172

    
173
  [{"name": "user", "last_modified": "2011-12-02T08:10:41.565891+00:00"}, ...]
174

    
175
Example ``format=xml`` reply:
176

    
177
::
178

    
179
  <?xml version="1.0" encoding="UTF-8"?>
180
  <accounts>
181
    <account>
182
      <name>user</name>
183
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
184
    </account>
185
    <account>...</account>
186
  </accounts>
187

    
188
===========================  =====================
189
Return Code                  Description
190
===========================  =====================
191
200 (OK)                     The request succeeded
192
204 (No Content)             The user has no access to other accounts (only for non-extended replies)
193
===========================  =====================
194

    
195
Will use a ``200`` return code if the reply is of type JSON/XML.
196

    
197
Account Level
198
^^^^^^^^^^^^^
199

    
200
List of operations:
201

    
202
=========  ==================
203
Operation  Description
204
=========  ==================
205
HEAD       Retrieve account metadata
206
GET        List containers
207
POST       Update account metadata
208
=========  ==================
209

    
210
HEAD
211
""""
212

    
213
====================  ===========================
214
Request Header Name   Value
215
====================  ===========================
216
If-Modified-Since     Retrieve if account has changed since provided timestamp
217
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
218
====================  ===========================
219

    
220
|
221

    
222
======================  ===================================
223
Request Parameter Name  Value
224
======================  ===================================
225
until                   Optional timestamp
226
======================  ===================================
227

    
228
Cross-user requests are not allowed to use ``until`` and only include the account modification date in the reply.
229

    
230
==========================  =====================
231
Reply Header Name           Value
232
==========================  =====================
233
X-Account-Container-Count   The total number of containers
234
X-Account-Bytes-Used        The total number of bytes stored
235
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
236
X-Account-Group-*           Optional user defined groups
237
X-Account-Policy-*          Account behavior and limits
238
X-Account-Meta-*            Optional user defined metadata
239
Last-Modified               The last account modification date (regardless of ``until``)
240
==========================  =====================
241

    
242
|
243

    
244
================  =====================
245
Return Code       Description
246
================  =====================
247
204 (No Content)  The request succeeded
248
================  =====================
249

    
250

    
251
GET
252
"""
253

    
254
====================  ===========================
255
Request Header Name   Value
256
====================  ===========================
257
If-Modified-Since     Retrieve if account has changed since provided timestamp
258
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
259
====================  ===========================
260

    
261
|
262

    
263
======================  =========================
264
Request Parameter Name  Value
265
======================  =========================
266
limit                   The amount of results requested (default is 10000)
267
marker                  Return containers with name lexicographically after marker
268
format                  Optional extended reply type (can be ``json`` or ``xml``)
269
shared                  Show only shared containers (no value parameter)
270
until                   Optional timestamp
271
======================  =========================
272

    
273
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
274
Cross-user requests are not allowed to use ``until`` and only include the account/container modification dates in the reply.
275

    
276
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
277
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
278

    
279
===========================  ============================
280
Name                         Description
281
===========================  ============================
282
name                         The name of the container
283
count                        The number of objects inside the container
284
bytes                        The total size of the objects inside the container
285
last_modified                The last container modification date (regardless of ``until``)
286
x_container_until_timestamp  The last container modification date until the timestamp provided
287
x_container_policy_*         Container behavior and limits
288
x_container_meta_*           Optional user defined metadata
289
===========================  ============================
290

    
291
Example ``format=json`` reply:
292

    
293
::
294

    
295
  [{"name": "pithos",
296
    "bytes": 62452,
297
    "count": 8374,
298
    "last_modified": "2011-12-02T08:10:41.565891+00:00",
299
    "x_container_policy": {"quota": "53687091200", "versioning": "auto"},
300
    "x_container_meta": {"a": "b", "1": "2"}}, ...]
301

    
302
Example ``format=xml`` reply:
303

    
304
::
305

    
306
  <?xml version="1.0" encoding="UTF-8"?>
307
  <account name="user">
308
    <container>
309
      <name>pithos</name>
310
      <bytes>62452</bytes>
311
      <count>8374</count>
312
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
313
      <x_container_policy>
314
        <key>quota</key><value>53687091200</value>
315
        <key>versioning</key><value>auto</value>
316
      </x_container_policy>
317
      <x_container_meta>
318
        <key>a</key><value>b</value>
319
        <key>1</key><value>2</value>
320
      </x_container_meta>
321
    </container>
322
    <container>...</container>
323
  </account>
324

    
325
For more examples of container details returned in JSON/XML formats refer to the OOS API documentation. In addition to the OOS API, Pithos returns all fields. Policy and metadata values are grouped and returned as key-value pairs.
326

    
327
===========================  =====================
328
Return Code                  Description
329
===========================  =====================
330
200 (OK)                     The request succeeded
331
204 (No Content)             The account has no containers (only for non-extended replies)
332
304 (Not Modified)           The account has not been modified
333
412 (Precondition Failed)    The condition set can not be satisfied
334
===========================  =====================
335

    
336
Will use a ``200`` return code if the reply is of type JSON/XML.
337

    
338

    
339
POST
340
""""
341

    
342
====================  ===========================
343
Request Header Name   Value
344
====================  ===========================
345
X-Account-Group-*     Optional user defined groups
346
X-Account-Meta-*      Optional user defined metadata
347
====================  ===========================
348

    
349
|
350

    
351
======================  ============================================
352
Request Parameter Name  Value
353
======================  ============================================
354
update                  Do not replace metadata/groups (no value parameter)
355
======================  ============================================
356

    
357
No reply content/headers.
358

    
359
The operation will overwrite all user defined metadata, except if ``update`` is defined.
360
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
361

    
362
================  ===============================
363
Return Code       Description
364
================  ===============================
365
202 (Accepted)    The request has been accepted
366
================  ===============================
367

    
368

    
369
Container Level
370
^^^^^^^^^^^^^^^
371

    
372
List of operations:
373

    
374
=========  ============================
375
Operation  Description
376
=========  ============================
377
HEAD       Retrieve container metadata
378
GET        List objects
379
PUT        Create/update container
380
POST       Update container metadata
381
DELETE     Delete container
382
=========  ============================
383

    
384

    
385
HEAD
386
""""
387

    
388
====================  ===========================
389
Request Header Name   Value
390
====================  ===========================
391
If-Modified-Since     Retrieve if container has changed since provided timestamp
392
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
393
====================  ===========================
394

    
395
|
396

    
397
======================  ===================================
398
Request Parameter Name  Value
399
======================  ===================================
400
until                   Optional timestamp
401
======================  ===================================
402

    
403
Cross-user requests are not allowed to use ``until`` and only include the container modification date in the reply.
404

    
405
===========================  ===============================
406
Reply Header Name            Value
407
===========================  ===============================
408
X-Container-Object-Count     The total number of objects in the container
409
X-Container-Bytes-Used       The total number of bytes of all objects stored
410
X-Container-Block-Size       The block size used by the storage backend
411
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
412
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
413
X-Container-Object-Meta      A list with all meta keys used by objects (**TBD**)
414
X-Container-Policy-*         Container behavior and limits
415
X-Container-Meta-*           Optional user defined metadata
416
Last-Modified                The last container modification date (regardless of ``until``)
417
===========================  ===============================
418

    
419
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix, formatted as a comma-separated list. See container ``PUT`` for a reference of policy directives. (**TBD**)
420

    
421
================  ===============================
422
Return Code       Description
423
================  ===============================
424
204 (No Content)  The request succeeded
425
================  ===============================
426

    
427

    
428
GET
429
"""
430

    
431
====================  ===========================
432
Request Header Name   Value
433
====================  ===========================
434
If-Modified-Since     Retrieve if container has changed since provided timestamp
435
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
436
====================  ===========================
437

    
438
|
439

    
440
======================  ===================================
441
Request Parameter Name  Value
442
======================  ===================================
443
limit                   The amount of results requested (default is 10000)
444
marker                  Return containers with name lexicographically after marker
445
prefix                  Return objects starting with prefix
446
delimiter               Return objects up to the delimiter (discussion follows)
447
path                    Assume ``prefix=path`` and ``delimiter=/``
448
format                  Optional extended reply type (can be ``json`` or ``xml``)
449
meta                    Return objects that satisfy the key queries in the specified comma separated list (use ``<key>``, ``!<key>`` for existence queries, ``<key><op><value>`` for value queries, where ``<op>`` can be one of ``=``, ``!=``, ``<=``, ``>=``, ``<``, ``>``)
450
shared                  Show only shared objects (no value parameter)
451
until                   Optional timestamp
452
======================  ===================================
453

    
454
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
455

    
456
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
457

    
458
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
459
Cross-user requests are not allowed to use ``until`` and include the following limited set of headers in the reply:
460

    
461
===========================  ===============================
462
Reply Header Name            Value
463
===========================  ===============================
464
X-Container-Block-Size       The block size used by the storage backend
465
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
466
X-Container-Object-Meta      A list with all meta keys used by allowed objects (**TBD**)
467
Last-Modified                The last container modification date
468
===========================  ===============================
469

    
470
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
471
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
472

    
473
==========================  ======================================
474
Name                        Description
475
==========================  ======================================
476
name                        The name of the object
477
hash                        The ETag of the object
478
bytes                       The size of the object
479
content_type                The MIME content type of the object
480
content_encoding            The encoding of the object (optional)
481
content-disposition         The presentation style of the object (optional)
482
last_modified               The last object modification date (regardless of version)
483
x_object_hash               The Merkle hash
484
x_object_uuid               The object's UUID
485
x_object_version            The object's version identifier
486
x_object_version_timestamp  The object's version timestamp
487
x_object_modified_by        The user that committed the object's version
488
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
489
x_object_sharing            Object permissions (optional)
490
x_object_shared_by          Object inheriting permissions (optional)
491
x_object_allowed_to         Allowed actions on object (optional)
492
x_object_public             Object's publicly accessible URI (optional)
493
x_object_meta_*             Optional user defined metadata
494
==========================  ======================================
495

    
496
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
497
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
498
In JSON results they appear as dictionaries with only a ``subdir`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
499
In case there is an object with the same name as a virtual directory marker, the object will be returned.
500

    
501
Example ``format=json`` reply:
502

    
503
::
504

    
505
  [{"name": "object",
506
    "bytes": 0,
507
    "hash": "d41d8cd98f00b204e9800998ecf8427e",
508
    "content_type": "application/octet-stream",
509
    "last_modified": "2011-12-02T08:10:41.565891+00:00",
510
    "x_object_meta": {"asdf": "qwerty"},
511
    "x_object_hash": "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855",
512
    "x_object_uuid": "8ed9af1b-c948-4bb6-82b0-48344f5c822c",
513
    "x_object_version": 98,
514
    "x_object_version_timestamp": "1322813441.565891",
515
    "x_object_modified_by": "user"}, ...]
516

    
517
Example ``format=xml`` reply:
518

    
519
::
520

    
521
  <?xml version="1.0" encoding="UTF-8"?>
522
  <container name="pithos">
523
    <object>
524
      <name>object</name>
525
      <bytes>0</bytes>
526
      <hash>d41d8cd98f00b204e9800998ecf8427e</hash>
527
      <content_type>application/octet-stream</content_type>
528
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
529
      <x_object_meta>
530
        <key>asdf</key><value>qwerty</value>
531
      </x_object_meta>
532
      <x_object_hash>e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855</x_object_hash>
533
      <x_object_uuid>8ed9af1b-c948-4bb6-82b0-48344f5c822c</x_object_uuid>
534
      <x_object_version>98</x_object_version>
535
      <x_object_version_timestamp>1322813441.565891</x_object_version_timestamp>
536
      <x_object_modified_by>chazapis</x_object_modified_by>
537
    </object>
538
    <object>...</object>
539
  </container>
540

    
541
For more examples of container details returned in JSON/XML formats refer to the OOS API documentation. In addition to the OOS API, Pithos returns all fields. Metadata values are grouped and returned as key-value pairs.
542

    
543
===========================  ===============================
544
Return Code                  Description
545
===========================  ===============================
546
200 (OK)                     The request succeeded
547
204 (No Content)             The account has no containers (only for non-extended replies)
548
304 (Not Modified)           The container has not been modified
549
412 (Precondition Failed)    The condition set can not be satisfied
550
===========================  ===============================
551

    
552
Will use a ``200`` return code if the reply is of type JSON/XML.
553

    
554

    
555
PUT
556
"""
557

    
558
====================  ================================
559
Request Header Name   Value
560
====================  ================================
561
X-Container-Policy-*  Container behavior and limits
562
X-Container-Meta-*    Optional user defined metadata
563
====================  ================================
564
 
565
No reply content/headers.
566

    
567
If no policy is defined, the container will be created with the default values.
568
Available policy directives:
569

    
570
* ``versioning``: Set to ``auto`` or ``none`` (default is ``auto``)
571
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
572

    
573
If the container already exists, the operation is equal to a ``POST`` with ``update`` defined.
574

    
575
================  ===============================
576
Return Code       Description
577
================  ===============================
578
201 (Created)     The container has been created
579
202 (Accepted)    The request has been accepted
580
================  ===============================
581

    
582

    
583
POST
584
""""
585

    
586
====================  ================================
587
Request Header Name   Value
588
====================  ================================
589
Content-Length        The size of the supplied data (optional, to upload)
590
Content-Type          The MIME content type of the supplied data (optional, to upload)
591
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
592
X-Container-Policy-*  Container behavior and limits
593
X-Container-Meta-*    Optional user defined metadata
594
====================  ================================
595

    
596
|
597

    
598
======================  ============================================
599
Request Parameter Name  Value
600
======================  ============================================
601
update                  Do not replace metadata/policy (no value parameter)
602
======================  ============================================
603

    
604
No reply content/headers, except when uploading data, where the reply consists of a list of hashes for the blocks received (in a simple text format, with one hash per line).
605

    
606
The operation will overwrite all user defined metadata, except if ``update`` is defined.
607
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
608

    
609
To upload blocks of data to the container, set ``Content-Type`` to ``application/octet-stream`` and ``Content-Length`` to a valid value (except if using ``chunked`` as the ``Transfer-Encoding``).
610

    
611
================  ===============================
612
Return Code       Description
613
================  ===============================
614
202 (Accepted)    The request has been accepted
615
================  ===============================
616

    
617

    
618
DELETE
619
""""""
620

    
621
======================  ===================================
622
Request Parameter Name  Value
623
======================  ===================================
624
until                   Optional timestamp
625
======================  ===================================
626

    
627
If ``until`` is defined, the container is "purged" up to that time (the history of all objects up to then is deleted).
628

    
629
No reply content/headers.
630

    
631
================  ===============================
632
Return Code       Description
633
================  ===============================
634
204 (No Content)  The request succeeded
635
409 (Conflict)    The container is not empty
636
================  ===============================
637

    
638

    
639
Object Level
640
^^^^^^^^^^^^
641

    
642
List of operations:
643

    
644
=========  =================================
645
Operation  Description
646
=========  =================================
647
HEAD       Retrieve object metadata
648
GET        Read object data
649
PUT        Write object data or copy/move object
650
COPY       Copy object
651
MOVE       Move object
652
POST       Update object metadata/data
653
DELETE     Delete object
654
=========  =================================
655

    
656

    
657
HEAD
658
""""
659

    
660
====================  ================================
661
Request Header Name   Value
662
====================  ================================
663
If-Match              Retrieve if ETags match
664
If-None-Match         Retrieve if ETags don't match
665
If-Modified-Since     Retrieve if object has changed since provided timestamp
666
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
667
====================  ================================
668

    
669
|
670

    
671
======================  ===================================
672
Request Parameter Name  Value
673
======================  ===================================
674
version                 Optional version identifier
675
======================  ===================================
676

    
677
|
678

    
679
==========================  ===============================
680
Reply Header Name           Value
681
==========================  ===============================
682
ETag                        The ETag of the object
683
Content-Length              The size of the object
684
Content-Type                The MIME content type of the object
685
Last-Modified               The last object modification date (regardless of version)
686
Content-Encoding            The encoding of the object (optional)
687
Content-Disposition         The presentation style of the object (optional)
688
X-Object-Hash               The Merkle hash
689
X-Object-UUID               The object's UUID
690
X-Object-Version            The object's version identifier
691
X-Object-Version-Timestamp  The object's version timestamp
692
X-Object-Modified-By        The user that comitted the object's version
693
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
694
X-Object-Sharing            Object permissions (optional)
695
X-Object-Shared-By          Object inheriting permissions (optional)
696
X-Object-Allowed-To         Allowed actions on object (optional)
697
X-Object-Public             Object's publicly accessible URI (optional)
698
X-Object-Meta-*             Optional user defined metadata
699
==========================  ===============================
700

    
701
|
702

    
703
================  ===============================
704
Return Code       Description
705
================  ===============================
706
200 (No Content)  The request succeeded
707
================  ===============================
708

    
709

    
710
GET
711
"""
712

    
713
====================  ================================
714
Request Header Name   Value
715
====================  ================================
716
Range                 Optional range of data to retrieve
717
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
718
If-Match              Retrieve if ETags match
719
If-None-Match         Retrieve if ETags don't match
720
If-Modified-Since     Retrieve if object has changed since provided timestamp
721
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
722
====================  ================================
723

    
724
|
725

    
726
======================  ===================================
727
Request Parameter Name  Value
728
======================  ===================================
729
format                  Optional extended reply type (can be ``json`` or ``xml``)
730
hashmap                 Optional request for hashmap (no value parameter)
731
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
732
======================  ===================================
733

    
734
The reply is the object's data (or part of it), except if a hashmap is requested with ``hashmap``, or a version list with ``version=list`` (in both cases an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
735

    
736
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block. The ``X-Object-Hash`` header reports the single Merkle hash of the object's hashmap (refer to http://bittorrent.org/beps/bep_0030.html for more information).
737

    
738
Example ``format=json`` reply:
739

    
740
::
741

    
742
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
743

    
744
Example ``format=xml`` reply:
745

    
746
::
747

    
748
  <?xml version="1.0" encoding="UTF-8"?>
749
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
750
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
751
    <hash>...</hash>
752
  </object>
753

    
754
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
755

    
756
Example ``format=json`` reply:
757

    
758
::
759

    
760
  {"versions": [[85, "1322734861.248469"], [86, "1322734905.009272"], ...]}
761

    
762
Example ``format=xml`` reply:
763

    
764
::
765

    
766
  <?xml version="1.0" encoding="UTF-8"?>
767
  <object name="file">
768
    <version timestamp="1322734861.248469">85</version>
769
    <version timestamp="1322734905.009272">86</version>
770
    <version timestamp="...">...</version>
771
  </object>
772

    
773
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
774

    
775
==========================  ===============================
776
Reply Header Name           Value
777
==========================  ===============================
778
ETag                        The ETag of the object
779
Content-Length              The size of the data returned
780
Content-Type                The MIME content type of the object
781
Content-Range               The range of data included (only on a single range request)
782
Last-Modified               The last object modification date (regardless of version)
783
Content-Encoding            The encoding of the object (optional)
784
Content-Disposition         The presentation style of the object (optional)
785
X-Object-Hash               The Merkle hash
786
X-Object-UUID               The object's UUID
787
X-Object-Version            The object's version identifier
788
X-Object-Version-Timestamp  The object's version timestamp
789
X-Object-Modified-By        The user that comitted the object's version
790
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
791
X-Object-Sharing            Object permissions (optional)
792
X-Object-Shared-By          Object inheriting permissions (optional)
793
X-Object-Allowed-To         Allowed actions on object (optional)
794
X-Object-Public             Object's publicly accessible URI (optional)
795
X-Object-Meta-*             Optional user defined metadata
796
==========================  ===============================
797

    
798
|
799

    
800
===========================  ==============================
801
Return Code                  Description
802
===========================  ==============================
803
200 (OK)                     The request succeeded
804
206 (Partial Content)        The range request succeeded
805
304 (Not Modified)           The object has not been modified
806
412 (Precondition Failed)    The condition set can not be satisfied
807
416 (Range Not Satisfiable)  The requested range is out of limits
808
===========================  ==============================
809

    
810

    
811
PUT
812
"""
813

    
814
====================  ================================
815
Request Header Name   Value
816
====================  ================================
817
If-Match              Put if ETags match with current object
818
If-None-Match         Put if ETags don't match with current object
819
ETag                  The MD5 hash of the object (optional to check written data)
820
Content-Length        The size of the data written
821
Content-Type          The MIME content type of the object
822
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
823
X-Copy-From           The source path in the form ``/<container>/<object>``
824
X-Move-From           The source path in the form ``/<container>/<object>``
825
X-Source-Account      The source account to copy/move from
826
X-Source-Version      The source version to copy from
827
Content-Encoding      The encoding of the object (optional)
828
Content-Disposition   The presentation style of the object (optional)
829
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
830
X-Object-Sharing      Object permissions (optional)
831
X-Object-Public       Object is publicly accessible (optional)
832
X-Object-Meta-*       Optional user defined metadata
833
====================  ================================
834

    
835
|
836

    
837
======================  ===================================
838
Request Parameter Name  Value
839
======================  ===================================
840
format                  Optional extended request type (can be ``json`` or ``xml``)
841
hashmap                 Optional hashmap provided instead of data (no value parameter)
842
======================  ===================================
843

    
844
The request is the object's data (or part of it), except if a hashmap is provided (using ``hashmap`` and ``format`` parameters). If using a hashmap and all different parts are stored in the server, the object is created, otherwise the server returns Conflict (409) with the list of the missing parts (in a simple text format, with one hash per line).
845

    
846
Hashmaps should be formatted as outlined in ``GET``.
847

    
848
==========================  ===============================
849
Reply Header Name           Value
850
==========================  ===============================
851
ETag                        The MD5 hash of the object
852
X-Object-Version            The object's new version
853
==========================  ===============================
854

    
855
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). Groups are specified as ``<account>:<group>``. To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
856

    
857
==============================  ==============================
858
Return Code                     Description
859
==============================  ==============================
860
201 (Created)                   The object has been created
861
409 (Conflict)                  The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a list of conflicting sharing paths will be included in the reply - in simple text format)
862
411 (Length Required)           Missing ``Content-Length`` or ``Content-Type`` in the request
863
413 (Request Entity Too Large)  Insufficient quota to complete the request
864
422 (Unprocessable Entity)      The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
865
==============================  ==============================
866

    
867

    
868
COPY
869
""""
870

    
871
====================  ================================
872
Request Header Name   Value
873
====================  ================================
874
If-Match              Proceed if ETags match with object
875
If-None-Match         Proceed if ETags don't match with object
876
Destination           The destination path in the form ``/<container>/<object>``
877
Destination-Account   The destination account to copy to
878
Content-Type          The MIME content type of the object (optional :sup:`*`)
879
Content-Encoding      The encoding of the object (optional)
880
Content-Disposition   The presentation style of the object (optional)
881
X-Source-Version      The source version to copy from
882
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
883
X-Object-Sharing      Object permissions (optional)
884
X-Object-Public       Object is publicly accessible (optional)
885
X-Object-Meta-*       Optional user defined metadata
886
====================  ================================
887

    
888
:sup:`*` *When using django locally with the supplied web server, do provide a valid Content-Type, as a type of text/plain is applied by default to all requests.*
889

    
890
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
891

    
892
==========================  ===============================
893
Reply Header Name           Value
894
==========================  ===============================
895
X-Object-Version            The object's new version
896
==========================  ===============================
897

    
898
|
899

    
900
==============================  ==============================
901
Return Code                     Description
902
==============================  ==============================
903
201 (Created)                   The object has been created
904
409 (Conflict)                  There are conflicting permissions (a list of conflicting sharing paths will be included in the reply - in simple text format)
905
413 (Request Entity Too Large)  Insufficient quota to complete the request
906
==============================  ==============================
907

    
908

    
909
MOVE
910
""""
911

    
912
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
913

    
914

    
915
POST
916
""""
917

    
918
====================  ================================
919
Request Header Name   Value
920
====================  ================================
921
If-Match              Proceed if ETags match with object
922
If-None-Match         Proceed if ETags don't match with object
923
Content-Length        The size of the data written (optional, to update)
924
Content-Type          The MIME content type of the object (optional, to update)
925
Content-Range         The range of data supplied (optional, to update)
926
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
927
Content-Encoding      The encoding of the object (optional)
928
Content-Disposition   The presentation style of the object (optional)
929
X-Source-Object       Update with data from the object at path ``/<container>/<object>`` (optional, to update)
930
X-Source-Account      The source account to update from
931
X-Source-Version      The source version to update from (optional, to update)
932
X-Object-Bytes        The updated object's final size (optional, when updating)
933
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
934
X-Object-Sharing      Object permissions (optional)
935
X-Object-Public       Object is publicly accessible (optional)
936
X-Object-Meta-*       Optional user defined metadata
937
====================  ================================
938

    
939
|
940

    
941
======================  ============================================
942
Request Parameter Name  Value
943
======================  ============================================
944
update                  Do not replace metadata (no value parameter)
945
======================  ============================================
946

    
947
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
948

    
949
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
950

    
951
To update an object's data:
952

    
953
* Either set ``Content-Type`` to ``application/octet-stream``, or provide an object with ``X-Source-Object``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
954
* If the data is supplied in the request (using ``Content-Type`` instead of ``X-Source-Object``), a valid ``Content-Length`` header is required - except if using chunked transfers (set ``Transfer-Encoding`` to ``chunked``).
955
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
956

    
957
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
958
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
959
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data will be appended to the object.
960

    
961
Optionally, truncate the updated object to the desired length with the ``X-Object-Bytes`` header.
962

    
963
A data update will trigger an ETag change. Updated ETags may happen asynchronously and appear at the server with a delay.
964

    
965
No reply content. No reply headers if only metadata is updated.
966

    
967
==========================  ===============================
968
Reply Header Name           Value
969
==========================  ===============================
970
ETag                        The new ETag of the object (data updated)
971
X-Object-Version            The object's new version
972
==========================  ===============================
973

    
974
|
975

    
976
==============================  ==============================
977
Return Code                     Description
978
==============================  ==============================
979
202 (Accepted)                  The request has been accepted (not a data update)
980
204 (No Content)                The request succeeded (data updated)
981
409 (Conflict)                  There are conflicting permissions (a list of conflicting sharing paths will be included in the reply - in simple text format)
982
411 (Length Required)           Missing ``Content-Length`` in the request
983
413 (Request Entity Too Large)  Insufficient quota to complete the request
984
416 (Range Not Satisfiable)     The supplied range is invalid
985
==============================  ==============================
986

    
987
The ``POST`` method can also be used for creating an object via a standard HTML form. If the request ``Content-Type`` is ``multipart/form-data``, none of the above headers will be processed. The form should have an ``X-Object-Data`` field, as in the following example. The token is passed as a request parameter. ::
988

    
989
  <form method="post" action="https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt?X-Auth-Token=0000" enctype="multipart/form-data">
990
    <input type="file" name="X-Object-Data">
991
    <input type="submit">
992
  </form>
993

    
994
This will create/override the object with the given name, as if using ``PUT``. The ``Content-Type`` of the object will be set to the value of the corresponding header sent in the part of the request containing the data (usually, automatically handled by the browser). Metadata, sharing and other object attributes can not be set this way.
995

    
996
==========================  ===============================
997
Reply Header Name           Value
998
==========================  ===============================
999
ETag                        The MD5 hash of the object
1000
X-Object-Version            The object's new version
1001
==========================  ===============================
1002

    
1003
|
1004

    
1005
==============================  ==============================
1006
Return Code                     Description
1007
==============================  ==============================
1008
201 (Created)                   The object has been created
1009
413 (Request Entity Too Large)  Insufficient quota to complete the request
1010
==============================  ==============================
1011

    
1012

    
1013
DELETE
1014
""""""
1015

    
1016
======================  ===================================
1017
Request Parameter Name  Value
1018
======================  ===================================
1019
until                   Optional timestamp
1020
======================  ===================================
1021

    
1022
If ``until`` is defined, the object is "purged" up to that time (the history up to then is deleted).
1023

    
1024
No reply content/headers.
1025

    
1026
===========================  ==============================
1027
Return Code                  Description
1028
===========================  ==============================
1029
204 (No Content)             The request succeeded
1030
===========================  ==============================
1031

    
1032
Sharing and Public Objects
1033
^^^^^^^^^^^^^^^^^^^^^^^^^^
1034

    
1035
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
1036

    
1037
A user may ``GET`` another account or container. The result will include a limited reply, containing only the allowed containers or objects respectively. A top-level request with an authentication token, will return a list of allowed accounts, so the user can easily find out which other users share objects. The ``X-Object-Allowed-To`` header lists the actions allowed on an object, if it does not belong to the requesting user.
1038

    
1039
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden):
1040

    
1041
==========================  ===============================
1042
Reply Header Name           Value
1043
==========================  ===============================
1044
ETag                        The ETag of the object
1045
Content-Length              The size of the data returned
1046
Content-Type                The MIME content type of the object
1047
Content-Range               The range of data included (only on a single range request)
1048
Last-Modified               The last object modification date (regardless of version)
1049
Content-Encoding            The encoding of the object (optional)
1050
Content-Disposition         The presentation style of the object (optional)
1051
==========================  ===============================
1052

    
1053
Public objects are not included and do not influence cross-user listings. They are, however, readable by all users.
1054

    
1055
Summary
1056
^^^^^^^
1057

    
1058
List of differences from the OOS API:
1059

    
1060
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
1061
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
1062
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings. (**TBD**)
1063
* Account and container policies to manage behavior and limits. Container behavior overrides account settings. Account quota sets the maximum bytes limit, regardless of container values.
1064
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
1065
* All metadata replies, at all levels, include latest modification information.
1066
* At all levels, a ``HEAD`` or ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
1067
* Container/object lists include all associated metadata if the reply is of type JSON/XML. Some names are kept to their OOS API equivalents for compatibility.
1068
* Option to include only shared containers/objects in listings.
1069
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
1070
* Multi-range object ``GET`` support as outlined in RFC2616.
1071
* Object hashmap retrieval through ``GET`` and the ``format`` parameter.
1072
* Object create via hashmap through ``PUT`` and the ``format`` parameter.
1073
* The object's Merkle hash is always returned in the ``X-Object-Hash`` header.
1074
* The object's UUID is always returned in the ``X-Object-UUID`` header. The UUID remains unchanged, even when the object's data or metadata changes, or the object is moved to another path (is renamed). A new UUID is assigned when creating or copying an object.
1075
* Object create using ``POST`` to support standard HTML forms.
1076
* Partial object updates through ``POST``, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers. Use another object's data to update with ``X-Source-Object`` and ``X-Source-Version``. Truncate with ``X-Object-Bytes``.
1077
* Include new version identifier in replies for object replace/change requests.
1078
* Object ``MOVE`` support.
1079
* Conditional object create/update operations, using ``If-Match`` and ``If-None-Match`` headers.
1080
* Time-variant account/container listings via the ``until`` parameter.
1081
* Object versions - parameter ``version`` in ``HEAD``/``GET`` (list versions with ``GET``), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in ``PUT``/``COPY``.
1082
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Cross-user operations are allowed - controlled by sharing directives. Available actions in cross-user requests are reported with ``X-Object-Allowed-To``. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
1083
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
1084
* Copy and move between accounts with ``X-Source-Account`` and ``Destination-Account`` headers.
1085
* Large object support with ``X-Object-Manifest``.
1086
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
1087
* Purge container/object history with the ``until`` parameter in ``DELETE``.
1088

    
1089
Clarifications/suggestions:
1090

    
1091
* All non-ASCII characters in headers should be URL-encoded.
1092
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
1093
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
1094
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
1095
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
1096
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired request/reply format. The parameter overrides the header.
1097
* Container/object lists use a ``200`` return code if the reply is of type JSON/XML. The reply will include an empty JSON/XML.
1098
* In headers, dates are formatted according to RFC 1123. In extended information listings, the ``last_modified`` field is formatted according to ISO 8601 (for OOS API compatibility). All other fields (Pithos extensions) use integer tiemstamps.
1099
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
1100
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
1101
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
1102

    
1103
The Pithos Client
1104
-----------------
1105

    
1106
User Experience
1107
^^^^^^^^^^^^^^^
1108

    
1109
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
1110

    
1111
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
1112

    
1113
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
1114
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
1115
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
1116
* The ``others`` element, which contains all objects that other users share with the user.
1117
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
1118
* The ``history`` element, which allows browsing past instances of ``home`` and - optionally - ``trash``.
1119

    
1120
Objects in Pithos can be:
1121

    
1122
* Moved to trash and then deleted.
1123
* Shared with specific permissions.
1124
* Made public (shared with non-Pithos users).
1125
* Restored from previous versions.
1126

    
1127
Some of these functions are performed by the client software and some by the Pithos server.
1128

    
1129
In the first version of Pithos, objects could also be assigned custom tags. This is no longer supported. Existing deployments can migrate tags into a specific metadata value, i.e. ``X-Object-Meta-Tags``.
1130

    
1131
Implementation Guidelines
1132
^^^^^^^^^^^^^^^^^^^^^^^^^
1133

    
1134
Pithos clients should use the ``pithos`` and ``trash`` containers for active and inactive objects respectively. If any of these containers is not found, the client software should create it, without interrupting the user's workflow. The ``home`` element corresponds to ``pithos`` and the ``trash`` element to ``trash``. Use ``PUT`` with the ``X-Move-From`` header, or ``MOVE`` to transfer objects from one container to the other. Use ``DELETE`` to remove from ``pithos`` without trashing, or to remove from ``trash``. When moving objects, detect naming conflicts with the ``If-Match`` or ``If-None-Match`` headers. Such conflicts should be resolved by the user.
1135

    
1136
Object names should use the ``/`` delimiter to impose a hierarchy of folders and files.
1137

    
1138
The ``shared`` element should be implemented as a read-only view of the ``pithos`` container, using the ``shared`` parameter when listing objects. The ``others`` element, should start with a top-level ``GET`` to retrieve the list of accounts accessible to the user. It is suggested that the client software hides the next step of navigation - the container - if it only includes ``pithos`` and forwards the user directly to the objects.
1139

    
1140
Public objects are not included in ``shared`` and ``others`` listings. It is suggested that they are marked in a visually distinctive way in ``pithos`` listings (for example using an icon overlay).
1141

    
1142
A special application menu, or a section in application preferences, should be devoted to managing groups (the ``groups`` element). All group-related actions are implemented at the account level.
1143

    
1144
Browsing past versions of objects should be available both at the object and the container level. At the object level, a list of past versions can be included in the screen showing details or more information on the object (metadata, permissions, etc.). At the container level, it is suggested that clients use a ``history`` element, which presents to the user a read-only, time-variable view of ``pithos`` contents. This can be accomplished via the ``until`` parameter in listings. Optionally, ``history`` may include ``trash``.
1145

    
1146
Uploading and downloading data
1147
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1148

    
1149
By using hashmaps to upload and download objects the corresponding operations can complete much faster.
1150

    
1151
In the case of an upload, only the missing blocks will be submitted to the server:
1152

    
1153
* Calculate the hash value for each block of the object to be uploaded. Use the hash algorithm and block size of the destination container.
1154
* Send a hashmap ``PUT`` request for the object.
1155

    
1156
  * Server responds with status ``201`` (Created):
1157

    
1158
    * Blocks are already on the server. The object has been created. Done.
1159

    
1160
  * Server responds with status ``409`` (Conflict):
1161

    
1162
    * Server's response body contains the hashes of the blocks that do not exist on the server.
1163
    * For each hash value in the server's response (or all hashes together):
1164

    
1165
      * Send a ``POST`` request to the destination container with the corresponding data.
1166

    
1167
* Repeat hashmap ``PUT``. Fail if the server's response is not ``201``.
1168

    
1169
Consulting hashmaps when downloading allows for resuming partially transferred objects. The client should retrieve the hashmap from the server and compare it with the hashmap computed from the respective local file. Any missing parts can be downloaded with ``GET`` requests with the additional ``Range`` header.
1170

    
1171
Syncing
1172
^^^^^^^
1173

    
1174
Consider the following algorithm for synchronizing a local folder with the server. The "state" is the complete object listing, with the corresponding attributes.
1175
 
1176
::
1177

    
1178
  L: local state (stored state from last sync with the server)
1179
  C: current state (state computed right before sync)
1180
  S: server state
1181

    
1182
  if C == L:
1183
      # No local changes
1184
      if S == L:
1185
          # No remote changes, nothing to do
1186
      else:
1187
          # Update local state to match that of the server
1188
         L = S
1189
  else:
1190
      # We have local changes
1191
      if S == L:
1192
          # No remote changes, update the server
1193
          S = C
1194
          L = S
1195
      else:
1196
          # Both we and server have changes
1197
          if C == S:
1198
              # We were lucky, we did the same change
1199
              L = S
1200
          else:
1201
              # We have conflicting changes
1202
              resolve conflict
1203

    
1204
Notes:
1205

    
1206
* States represent file hashes (it is suggested to use Merkle). Deleted or non-existing files are assumed to have a magic hash (e.g. empty string).
1207
* Updating a state (either local or remote) implies downloading, uploading or deleting the appropriate file.
1208

    
1209
Recommended Practices and Examples
1210
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1211

    
1212
Assuming an authentication token is obtained, the following high-level operations are available - shown with ``curl``:
1213

    
1214
* Get account information ::
1215

    
1216
    curl -X HEAD -D - \
1217
         -H "X-Auth-Token: 0000" \
1218
         https://pithos.dev.grnet.gr/v1/user
1219

    
1220
* List available containers ::
1221

    
1222
    curl -X GET -D - \
1223
         -H "X-Auth-Token: 0000" \
1224
         https://pithos.dev.grnet.gr/v1/user
1225

    
1226
* Get container information ::
1227

    
1228
    curl -X HEAD -D - \
1229
         -H "X-Auth-Token: 0000" \
1230
         https://pithos.dev.grnet.gr/v1/user/pithos
1231

    
1232
* Add a new container ::
1233

    
1234
    curl -X PUT -D - \
1235
         -H "X-Auth-Token: 0000" \
1236
         https://pithos.dev.grnet.gr/v1/user/test
1237

    
1238
* Delete a container ::
1239

    
1240
    curl -X DELETE -D - \
1241
         -H "X-Auth-Token: 0000" \
1242
         https://pithos.dev.grnet.gr/v1/user/test
1243

    
1244
* List objects in a container ::
1245

    
1246
    curl -X GET -D - \
1247
         -H "X-Auth-Token: 0000" \
1248
         https://pithos.dev.grnet.gr/v1/user/pithos
1249

    
1250
* List objects in a container (extended reply) ::
1251

    
1252
    curl -X GET -D - \
1253
         -H "X-Auth-Token: 0000" \
1254
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
1255

    
1256
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
1257

    
1258
* List metadata keys used by objects in a container
1259

    
1260
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``). (**TBD**)
1261

    
1262
* List objects in a container having a specific meta defined ::
1263

    
1264
    curl -X GET -D - \
1265
         -H "X-Auth-Token: 0000" \
1266
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=favorites
1267

    
1268
* Retrieve an object ::
1269

    
1270
    curl -X GET -D - \
1271
         -H "X-Auth-Token: 0000" \
1272
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
1273

    
1274
* Retrieve an object (specific ranges of data) ::
1275

    
1276
    curl -X GET -D - \
1277
         -H "X-Auth-Token: 0000" \
1278
         -H "Range: bytes=0-9" \
1279
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
1280

    
1281
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
1282

    
1283
* Add a new object (folder type) (**TBD**) ::
1284

    
1285
    curl -X PUT -D - \
1286
         -H "X-Auth-Token: 0000" \
1287
         -H "Content-Type: application/directory" \
1288
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
1289

    
1290
* Add a new object ::
1291

    
1292
    curl -X PUT -D - \
1293
         -H "X-Auth-Token: 0000" \
1294
         -H "Content-Type: text/plain" \
1295
         -T EXAMPLE.txt
1296
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
1297

    
1298
* Update an object ::
1299

    
1300
    curl -X POST -D - \
1301
         -H "X-Auth-Token: 0000" \
1302
         -H "Content-Length: 10" \
1303
         -H "Content-Type: application/octet-stream" \
1304
         -H "Content-Range: bytes 10-19/*" \
1305
         -d "0123456789" \
1306
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1307

    
1308
  This will update bytes 10-19 with the data specified.
1309

    
1310
* Update an object (append) ::
1311

    
1312
    curl -X POST -D - \
1313
         -H "X-Auth-Token: 0000" \
1314
         -H "Content-Length: 10" \
1315
         -H "Content-Type: application/octet-stream" \
1316
         -H "Content-Range: bytes */*" \
1317
         -d "0123456789" \
1318
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1319

    
1320
* Update an object (truncate) ::
1321

    
1322
    curl -X POST -D - \
1323
         -H "X-Auth-Token: 0000" \
1324
         -H "X-Source-Object: /folder/EXAMPLE.txt" \
1325
         -H "Content-Range: bytes 0-0/*" \
1326
         -H "X-Object-Bytes: 0" \
1327
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1328

    
1329
  This will truncate the object to 0 bytes.
1330

    
1331
* Add object metadata ::
1332

    
1333
    curl -X POST -D - \
1334
         -H "X-Auth-Token: 0000" \
1335
         -H "X-Object-Meta-First: first_meta_value" \
1336
         -H "X-Object-Meta-Second: second_meta_value" \
1337
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1338

    
1339
* Delete object metadata ::
1340

    
1341
    curl -X POST -D - \
1342
         -H "X-Auth-Token: 0000" \
1343
         -H "X-Object-Meta-First: first_meta_value" \
1344
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1345

    
1346
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
1347

    
1348
* Delete an object ::
1349

    
1350
    curl -X DELETE -D - \
1351
         -H "X-Auth-Token: 0000" \
1352
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt