Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ 53cff70c

History | View | Annotate | Download (69.9 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
More info about Pithos can be found here: https://code.grnet.gr/projects/pithos
23

    
24
Document Revisions
25
^^^^^^^^^^^^^^^^^^
26

    
27
=========================  ================================
28
Revision                   Description
29
=========================  ================================
30
0.8 (Jan 24, 2012)         Update allowed versioning values.
31
\                          Change policy/meta formatting in JSON/XML replies.
32
\                          Document that all non-ASCII characters in headers should be URL-encoded.
33
\                          Support metadata-based queries when listing objects at the container level.
34
\                          Note Content-Type issue when using the internal django web server.
35
\                          Add object UUID field.
36
\                          Always reply with the MD5 in the ETag.
37
\                          Note that ``/login`` will only work if an external authentication system is defined.
38
\                          Include option to ignore Content-Type on ``COPY``/``MOVE``.
39
0.7 (Nov 21, 2011)         Suggest upload/download methods using hashmaps.
40
\                          Propose syncing algorithm.
41
\                          Support cross-account object copy and move.
42
\                          Pass token as a request parameter when using ``POST`` via an HTML form.
43
\                          Optionally use source account to update object from another object.
44
\                          Use container ``POST`` to upload missing blocks of data.
45
\                          Report policy in account headers.
46
\                          Add insufficient quota reply.
47
\                          Use special meta to always report Merkle hash.
48
0.6 (Sept 13, 2011)        Reply with Merkle hash as the ETag when updating objects.
49
\                          Include version id in object replace/change replies.
50
\                          Change conflict (409) replies format to text.
51
\                          Tags should be migrated to a meta value.
52
\                          Container ``PUT`` updates metadata/policy.
53
\                          Report allowed actions in shared object replies.
54
\                          Provide ``https://hostname/login`` for Shibboleth authentication.
55
\                          Use ``hashmap`` parameter in object ``GET``/``PUT`` to use hashmaps.
56
0.5 (July 22, 2011)        Object update from another object's data.
57
\                          Support object truncate.
58
\                          Create object using a standard HTML form.
59
\                          Purge container/object history.
60
\                          List other accounts that share objects with a user.
61
\                          List shared containers/objects.
62
\                          Update implementation guidelines.
63
\                          Check preconditions when creating/updating objects.
64
0.4 (July 01, 2011)        Object permissions and account groups.
65
\                          Control versioning behavior and container quotas with container policy directives.
66
\                          Support updating/deleting individual metadata with ``POST``.
67
\                          Create object using hashmap.
68
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
69
\                          Allow for publicly available objects via ``https://hostname/public``.
70
\                          Support time-variant account/container listings. 
71
\                          Add source version when duplicating with ``PUT``/``COPY``.
72
\                          Request version in object ``HEAD``/``GET`` requests (list versions with ``GET``).
73
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
74
\                          Include underlying storage characteristics in container meta.
75
\                          Support for partial object updates through ``POST``.
76
\                          Expose object hashmaps through ``GET``.
77
\                          Support for multi-range object ``GET`` requests.
78
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
79
=========================  ================================
80

    
81
Pithos Users and Authentication
82
-------------------------------
83

    
84
In Pithos, each user is uniquely identified by a token. All API requests require a token and each token is internally resolved to an account string. The API uses the account string to identify the user's own files, thus whether a request is local or cross-account.
85

    
86
Pithos does not keep a user database. For development and testing purposes, user identifiers and their corresponding tokens can be defined in the settings file. However, Pithos is designed with an external authentication service in mind. This service must handle the details of validating user credentials and communicate with Pithos via a middleware software component that, given a token, fills in the internal request account variable.
87

    
88
Client software using Pithos, if not already knowing a user's identifier and token, should forward to the ``/login`` URI. The Pithos server, depending on its configuration will redirect to the appropriate login page.
89

    
90
The login URI accepts the following parameters:
91

    
92
======================  =========================
93
Request Parameter Name  Value
94
======================  =========================
95
next                    The URI to redirect to when the process is finished
96
renew                   Force token renewal (no value parameter)
97
======================  =========================
98

    
99
When done with logging in, the service's login URI should redirect to the URI provided with ``next``, adding ``user`` and ``token`` parameters, which contain the account and token fields respectively.
100

    
101
A user management service that implements a login URI according to these conventions is Astakos (https://code.grnet.gr/projects/astakos), by GRNET.
102

    
103
The Pithos API
104
--------------
105

    
106
The URI requests supported by the Pithos API follow one of the following forms:
107

    
108
* Top level: ``https://hostname/v1/``
109
* Account level: ``https://hostname/v1/<account>``
110
* Container level: ``https://hostname/v1/<account>/<container>``
111
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
112

    
113
All requests must include an ``X-Auth-Token`` - as a header, or a parameter.
114

    
115
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
116

    
117
=========================  ================================
118
Return Code                Description
119
=========================  ================================
120
400 (Bad Request)          The request is invalid
121
401 (Unauthorized)         Missing or invalid token
122
403 (Forbidden)            Request not allowed
123
404 (Not Found)            The requested resource was not found
124
503 (Service Unavailable)  The request cannot be completed because of an internal error
125
=========================  ================================
126

    
127
Top Level
128
^^^^^^^^^
129

    
130
List of operations:
131

    
132
=========  ==================
133
Operation  Description
134
=========  ==================
135
GET        Authentication (for compatibility with the OOS API) or list allowed accounts
136
=========  ==================
137

    
138
GET
139
"""
140

    
141
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
142

    
143
================  =====================
144
Return Code       Description
145
================  =====================
146
204 (No Content)  The request succeeded
147
================  =====================
148

    
149
If an ``X-Auth-Token`` is already present, the operation will be interpreted as a request to list other accounts that share objects to the user.
150

    
151
======================  =========================
152
Request Parameter Name  Value
153
======================  =========================
154
limit                   The amount of results requested (default is 10000)
155
marker                  Return containers with name lexicographically after marker
156
format                  Optional extended reply type (can be ``json`` or ``xml``)
157
======================  =========================
158

    
159
The reply is a list of account names.
160
If a ``format=xml`` or ``format=json`` argument is given, extended information on the accounts will be returned, serialized in the chosen format.
161
For each account, the information will include the following (names will be in lower case and with hyphens replaced with underscores):
162

    
163
===========================  ============================
164
Name                         Description
165
===========================  ============================
166
name                         The name of the account
167
last_modified                The last account modification date (regardless of ``until``)
168
===========================  ============================
169

    
170
Example ``format=json`` reply:
171

    
172
::
173

    
174
  [{"name": "user", "last_modified": "2011-12-02T08:10:41.565891+00:00"}, ...]
175

    
176
Example ``format=xml`` reply:
177

    
178
::
179

    
180
  <?xml version="1.0" encoding="UTF-8"?>
181
  <accounts>
182
    <account>
183
      <name>user</name>
184
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
185
    </account>
186
    <account>...</account>
187
  </accounts>
188

    
189
===========================  =====================
190
Return Code                  Description
191
===========================  =====================
192
200 (OK)                     The request succeeded
193
204 (No Content)             The user has no access to other accounts (only for non-extended replies)
194
===========================  =====================
195

    
196
Will use a ``200`` return code if the reply is of type JSON/XML.
197

    
198
Account Level
199
^^^^^^^^^^^^^
200

    
201
List of operations:
202

    
203
=========  ==================
204
Operation  Description
205
=========  ==================
206
HEAD       Retrieve account metadata
207
GET        List containers
208
POST       Update account metadata
209
=========  ==================
210

    
211
HEAD
212
""""
213

    
214
====================  ===========================
215
Request Header Name   Value
216
====================  ===========================
217
If-Modified-Since     Retrieve if account has changed since provided timestamp
218
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
219
====================  ===========================
220

    
221
|
222

    
223
======================  ===================================
224
Request Parameter Name  Value
225
======================  ===================================
226
until                   Optional timestamp
227
======================  ===================================
228

    
229
Cross-user requests are not allowed to use ``until`` and only include the account modification date in the reply.
230

    
231
==========================  =====================
232
Reply Header Name           Value
233
==========================  =====================
234
X-Account-Container-Count   The total number of containers
235
X-Account-Bytes-Used        The total number of bytes stored
236
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
237
X-Account-Group-*           Optional user defined groups
238
X-Account-Policy-*          Account behavior and limits
239
X-Account-Meta-*            Optional user defined metadata
240
Last-Modified               The last account modification date (regardless of ``until``)
241
==========================  =====================
242

    
243
|
244

    
245
================  =====================
246
Return Code       Description
247
================  =====================
248
204 (No Content)  The request succeeded
249
================  =====================
250

    
251

    
252
GET
253
"""
254

    
255
====================  ===========================
256
Request Header Name   Value
257
====================  ===========================
258
If-Modified-Since     Retrieve if account has changed since provided timestamp
259
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
260
====================  ===========================
261

    
262
|
263

    
264
======================  =========================
265
Request Parameter Name  Value
266
======================  =========================
267
limit                   The amount of results requested (default is 10000)
268
marker                  Return containers with name lexicographically after marker
269
format                  Optional extended reply type (can be ``json`` or ``xml``)
270
shared                  Show only shared containers (no value parameter)
271
until                   Optional timestamp
272
======================  =========================
273

    
274
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
275
Cross-user requests are not allowed to use ``until`` and only include the account/container modification dates in the reply.
276

    
277
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
278
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
279

    
280
===========================  ============================
281
Name                         Description
282
===========================  ============================
283
name                         The name of the container
284
count                        The number of objects inside the container
285
bytes                        The total size of the objects inside the container
286
last_modified                The last container modification date (regardless of ``until``)
287
x_container_until_timestamp  The last container modification date until the timestamp provided
288
x_container_policy_*         Container behavior and limits
289
x_container_meta_*           Optional user defined metadata
290
===========================  ============================
291

    
292
Example ``format=json`` reply:
293

    
294
::
295

    
296
  [{"name": "pithos",
297
    "bytes": 62452,
298
    "count": 8374,
299
    "last_modified": "2011-12-02T08:10:41.565891+00:00",
300
    "x_container_policy": {"quota": "53687091200", "versioning": "auto"},
301
    "x_container_meta": {"a": "b", "1": "2"}}, ...]
302

    
303
Example ``format=xml`` reply:
304

    
305
::
306

    
307
  <?xml version="1.0" encoding="UTF-8"?>
308
  <account name="user">
309
    <container>
310
      <name>pithos</name>
311
      <bytes>62452</bytes>
312
      <count>8374</count>
313
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
314
      <x_container_policy>
315
        <key>quota</key><value>53687091200</value>
316
        <key>versioning</key><value>auto</value>
317
      </x_container_policy>
318
      <x_container_meta>
319
        <key>a</key><value>b</value>
320
        <key>1</key><value>2</value>
321
      </x_container_meta>
322
    </container>
323
    <container>...</container>
324
  </account>
325

    
326
For more examples of container details returned in JSON/XML formats refer to the OOS API documentation. In addition to the OOS API, Pithos returns all fields. Policy and metadata values are grouped and returned as key-value pairs.
327

    
328
===========================  =====================
329
Return Code                  Description
330
===========================  =====================
331
200 (OK)                     The request succeeded
332
204 (No Content)             The account has no containers (only for non-extended replies)
333
304 (Not Modified)           The account has not been modified
334
412 (Precondition Failed)    The condition set can not be satisfied
335
===========================  =====================
336

    
337
Will use a ``200`` return code if the reply is of type JSON/XML.
338

    
339

    
340
POST
341
""""
342

    
343
====================  ===========================
344
Request Header Name   Value
345
====================  ===========================
346
X-Account-Group-*     Optional user defined groups
347
X-Account-Meta-*      Optional user defined metadata
348
====================  ===========================
349

    
350
|
351

    
352
======================  ============================================
353
Request Parameter Name  Value
354
======================  ============================================
355
update                  Do not replace metadata/groups (no value parameter)
356
======================  ============================================
357

    
358
No reply content/headers.
359

    
360
The operation will overwrite all user defined metadata, except if ``update`` is defined.
361
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
362

    
363
================  ===============================
364
Return Code       Description
365
================  ===============================
366
202 (Accepted)    The request has been accepted
367
================  ===============================
368

    
369

    
370
Container Level
371
^^^^^^^^^^^^^^^
372

    
373
List of operations:
374

    
375
=========  ============================
376
Operation  Description
377
=========  ============================
378
HEAD       Retrieve container metadata
379
GET        List objects
380
PUT        Create/update container
381
POST       Update container metadata
382
DELETE     Delete container
383
=========  ============================
384

    
385

    
386
HEAD
387
""""
388

    
389
====================  ===========================
390
Request Header Name   Value
391
====================  ===========================
392
If-Modified-Since     Retrieve if container has changed since provided timestamp
393
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
394
====================  ===========================
395

    
396
|
397

    
398
======================  ===================================
399
Request Parameter Name  Value
400
======================  ===================================
401
until                   Optional timestamp
402
======================  ===================================
403

    
404
Cross-user requests are not allowed to use ``until`` and only include the container modification date in the reply.
405

    
406
===========================  ===============================
407
Reply Header Name            Value
408
===========================  ===============================
409
X-Container-Object-Count     The total number of objects in the container
410
X-Container-Bytes-Used       The total number of bytes of all objects stored
411
X-Container-Block-Size       The block size used by the storage backend
412
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
413
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
414
X-Container-Object-Meta      A list with all meta keys used by objects (**TBD**)
415
X-Container-Policy-*         Container behavior and limits
416
X-Container-Meta-*           Optional user defined metadata
417
Last-Modified                The last container modification date (regardless of ``until``)
418
===========================  ===============================
419

    
420
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix, formatted as a comma-separated list. See container ``PUT`` for a reference of policy directives. (**TBD**)
421

    
422
================  ===============================
423
Return Code       Description
424
================  ===============================
425
204 (No Content)  The request succeeded
426
================  ===============================
427

    
428

    
429
GET
430
"""
431

    
432
====================  ===========================
433
Request Header Name   Value
434
====================  ===========================
435
If-Modified-Since     Retrieve if container has changed since provided timestamp
436
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
437
====================  ===========================
438

    
439
|
440

    
441
======================  ===================================
442
Request Parameter Name  Value
443
======================  ===================================
444
limit                   The amount of results requested (default is 10000)
445
marker                  Return containers with name lexicographically after marker
446
prefix                  Return objects starting with prefix
447
delimiter               Return objects up to the delimiter (discussion follows)
448
path                    Assume ``prefix=path`` and ``delimiter=/``
449
format                  Optional extended reply type (can be ``json`` or ``xml``)
450
meta                    Return objects that satisfy the key queries in the specified comma separated list (use ``<key>``, ``!<key>`` for existence queries, ``<key><op><value>`` for value queries, where ``<op>`` can be one of ``=``, ``!=``, ``<=``, ``>=``, ``<``, ``>``)
451
shared                  Show only shared objects (no value parameter)
452
until                   Optional timestamp
453
======================  ===================================
454

    
455
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
456

    
457
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
458

    
459
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
460
Cross-user requests are not allowed to use ``until`` and include the following limited set of headers in the reply:
461

    
462
===========================  ===============================
463
Reply Header Name            Value
464
===========================  ===============================
465
X-Container-Block-Size       The block size used by the storage backend
466
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
467
X-Container-Object-Meta      A list with all meta keys used by allowed objects (**TBD**)
468
Last-Modified                The last container modification date
469
===========================  ===============================
470

    
471
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
472
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
473

    
474
==========================  ======================================
475
Name                        Description
476
==========================  ======================================
477
name                        The name of the object
478
hash                        The ETag of the object
479
bytes                       The size of the object
480
content_type                The MIME content type of the object
481
content_encoding            The encoding of the object (optional)
482
content-disposition         The presentation style of the object (optional)
483
last_modified               The last object modification date (regardless of version)
484
x_object_hash               The Merkle hash
485
x_object_uuid               The object's UUID
486
x_object_version            The object's version identifier
487
x_object_version_timestamp  The object's version timestamp
488
x_object_modified_by        The user that committed the object's version
489
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
490
x_object_sharing            Object permissions (optional)
491
x_object_shared_by          Object inheriting permissions (optional)
492
x_object_allowed_to         Allowed actions on object (optional)
493
x_object_public             Object's publicly accessible URI (optional)
494
x_object_meta_*             Optional user defined metadata
495
==========================  ======================================
496

    
497
Sharing metadata will only be returned if there is no ``until`` parameter defined.
498

    
499
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
500
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
501
In JSON results they appear as dictionaries with only a ``subdir`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
502
In case there is an object with the same name as a virtual directory marker, the object will be returned.
503

    
504
Example ``format=json`` reply:
505

    
506
::
507

    
508
  [{"name": "object",
509
    "bytes": 0,
510
    "hash": "d41d8cd98f00b204e9800998ecf8427e",
511
    "content_type": "application/octet-stream",
512
    "last_modified": "2011-12-02T08:10:41.565891+00:00",
513
    "x_object_meta": {"asdf": "qwerty"},
514
    "x_object_hash": "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855",
515
    "x_object_uuid": "8ed9af1b-c948-4bb6-82b0-48344f5c822c",
516
    "x_object_version": 98,
517
    "x_object_version_timestamp": "1322813441.565891",
518
    "x_object_modified_by": "user"}, ...]
519

    
520
Example ``format=xml`` reply:
521

    
522
::
523

    
524
  <?xml version="1.0" encoding="UTF-8"?>
525
  <container name="pithos">
526
    <object>
527
      <name>object</name>
528
      <bytes>0</bytes>
529
      <hash>d41d8cd98f00b204e9800998ecf8427e</hash>
530
      <content_type>application/octet-stream</content_type>
531
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
532
      <x_object_meta>
533
        <key>asdf</key><value>qwerty</value>
534
      </x_object_meta>
535
      <x_object_hash>e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855</x_object_hash>
536
      <x_object_uuid>8ed9af1b-c948-4bb6-82b0-48344f5c822c</x_object_uuid>
537
      <x_object_version>98</x_object_version>
538
      <x_object_version_timestamp>1322813441.565891</x_object_version_timestamp>
539
      <x_object_modified_by>chazapis</x_object_modified_by>
540
    </object>
541
    <object>...</object>
542
  </container>
543

    
544
For more examples of container details returned in JSON/XML formats refer to the OOS API documentation. In addition to the OOS API, Pithos returns all fields. Metadata values are grouped and returned as key-value pairs.
545

    
546
===========================  ===============================
547
Return Code                  Description
548
===========================  ===============================
549
200 (OK)                     The request succeeded
550
204 (No Content)             The account has no containers (only for non-extended replies)
551
304 (Not Modified)           The container has not been modified
552
412 (Precondition Failed)    The condition set can not be satisfied
553
===========================  ===============================
554

    
555
Will use a ``200`` return code if the reply is of type JSON/XML.
556

    
557

    
558
PUT
559
"""
560

    
561
====================  ================================
562
Request Header Name   Value
563
====================  ================================
564
X-Container-Policy-*  Container behavior and limits
565
X-Container-Meta-*    Optional user defined metadata
566
====================  ================================
567
 
568
No reply content/headers.
569

    
570
If no policy is defined, the container will be created with the default values.
571
Available policy directives:
572

    
573
* ``versioning``: Set to ``auto`` or ``none`` (default is ``auto``)
574
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
575

    
576
If the container already exists, the operation is equal to a ``POST`` with ``update`` defined.
577

    
578
================  ===============================
579
Return Code       Description
580
================  ===============================
581
201 (Created)     The container has been created
582
202 (Accepted)    The request has been accepted
583
================  ===============================
584

    
585

    
586
POST
587
""""
588

    
589
====================  ================================
590
Request Header Name   Value
591
====================  ================================
592
Content-Length        The size of the supplied data (optional, to upload)
593
Content-Type          The MIME content type of the supplied data (optional, to upload)
594
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
595
X-Container-Policy-*  Container behavior and limits
596
X-Container-Meta-*    Optional user defined metadata
597
====================  ================================
598

    
599
|
600

    
601
======================  ============================================
602
Request Parameter Name  Value
603
======================  ============================================
604
update                  Do not replace metadata/policy (no value parameter)
605
======================  ============================================
606

    
607
No reply content/headers, except when uploading data, where the reply consists of a list of hashes for the blocks received (in a simple text format, with one hash per line).
608

    
609
The operation will overwrite all user defined metadata, except if ``update`` is defined.
610
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
611

    
612
To upload blocks of data to the container, set ``Content-Type`` to ``application/octet-stream`` and ``Content-Length`` to a valid value (except if using ``chunked`` as the ``Transfer-Encoding``).
613

    
614
================  ===============================
615
Return Code       Description
616
================  ===============================
617
202 (Accepted)    The request has been accepted
618
================  ===============================
619

    
620

    
621
DELETE
622
""""""
623

    
624
======================  ===================================
625
Request Parameter Name  Value
626
======================  ===================================
627
until                   Optional timestamp
628
======================  ===================================
629

    
630
If ``until`` is defined, the container is "purged" up to that time (the history of all objects up to then is deleted).
631

    
632
No reply content/headers.
633

    
634
================  ===============================
635
Return Code       Description
636
================  ===============================
637
204 (No Content)  The request succeeded
638
409 (Conflict)    The container is not empty
639
================  ===============================
640

    
641

    
642
Object Level
643
^^^^^^^^^^^^
644

    
645
List of operations:
646

    
647
=========  =================================
648
Operation  Description
649
=========  =================================
650
HEAD       Retrieve object metadata
651
GET        Read object data
652
PUT        Write object data or copy/move object
653
COPY       Copy object
654
MOVE       Move object
655
POST       Update object metadata/data
656
DELETE     Delete object
657
=========  =================================
658

    
659

    
660
HEAD
661
""""
662

    
663
====================  ================================
664
Request Header Name   Value
665
====================  ================================
666
If-Match              Retrieve if ETags match
667
If-None-Match         Retrieve if ETags don't match
668
If-Modified-Since     Retrieve if object has changed since provided timestamp
669
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
670
====================  ================================
671

    
672
|
673

    
674
======================  ===================================
675
Request Parameter Name  Value
676
======================  ===================================
677
version                 Optional version identifier
678
======================  ===================================
679

    
680
|
681

    
682
==========================  ===============================
683
Reply Header Name           Value
684
==========================  ===============================
685
ETag                        The ETag of the object
686
Content-Length              The size of the object
687
Content-Type                The MIME content type of the object
688
Last-Modified               The last object modification date (regardless of version)
689
Content-Encoding            The encoding of the object (optional)
690
Content-Disposition         The presentation style of the object (optional)
691
X-Object-Hash               The Merkle hash
692
X-Object-UUID               The object's UUID
693
X-Object-Version            The object's version identifier
694
X-Object-Version-Timestamp  The object's version timestamp
695
X-Object-Modified-By        The user that comitted the object's version
696
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
697
X-Object-Sharing            Object permissions (optional)
698
X-Object-Shared-By          Object inheriting permissions (optional)
699
X-Object-Allowed-To         Allowed actions on object (optional)
700
X-Object-Public             Object's publicly accessible URI (optional)
701
X-Object-Meta-*             Optional user defined metadata
702
==========================  ===============================
703

    
704
|
705

    
706
================  ===============================
707
Return Code       Description
708
================  ===============================
709
200 (No Content)  The request succeeded
710
================  ===============================
711

    
712

    
713
GET
714
"""
715

    
716
====================  ================================
717
Request Header Name   Value
718
====================  ================================
719
Range                 Optional range of data to retrieve
720
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
721
If-Match              Retrieve if ETags match
722
If-None-Match         Retrieve if ETags don't match
723
If-Modified-Since     Retrieve if object has changed since provided timestamp
724
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
725
====================  ================================
726

    
727
|
728

    
729
======================  ===================================
730
Request Parameter Name  Value
731
======================  ===================================
732
format                  Optional extended reply type (can be ``json`` or ``xml``)
733
hashmap                 Optional request for hashmap (no value parameter)
734
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
735
======================  ===================================
736

    
737
The reply is the object's data (or part of it), except if a hashmap is requested with ``hashmap``, or a version list with ``version=list`` (in both cases an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
738

    
739
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block. The ``X-Object-Hash`` header reports the single Merkle hash of the object's hashmap (refer to http://bittorrent.org/beps/bep_0030.html for more information).
740

    
741
Example ``format=json`` reply:
742

    
743
::
744

    
745
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
746

    
747
Example ``format=xml`` reply:
748

    
749
::
750

    
751
  <?xml version="1.0" encoding="UTF-8"?>
752
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
753
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
754
    <hash>...</hash>
755
  </object>
756

    
757
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
758

    
759
Example ``format=json`` reply:
760

    
761
::
762

    
763
  {"versions": [[85, "1322734861.248469"], [86, "1322734905.009272"], ...]}
764

    
765
Example ``format=xml`` reply:
766

    
767
::
768

    
769
  <?xml version="1.0" encoding="UTF-8"?>
770
  <object name="file">
771
    <version timestamp="1322734861.248469">85</version>
772
    <version timestamp="1322734905.009272">86</version>
773
    <version timestamp="...">...</version>
774
  </object>
775

    
776
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
777

    
778
==========================  ===============================
779
Reply Header Name           Value
780
==========================  ===============================
781
ETag                        The ETag of the object
782
Content-Length              The size of the data returned
783
Content-Type                The MIME content type of the object
784
Content-Range               The range of data included (only on a single range request)
785
Last-Modified               The last object modification date (regardless of version)
786
Content-Encoding            The encoding of the object (optional)
787
Content-Disposition         The presentation style of the object (optional)
788
X-Object-Hash               The Merkle hash
789
X-Object-UUID               The object's UUID
790
X-Object-Version            The object's version identifier
791
X-Object-Version-Timestamp  The object's version timestamp
792
X-Object-Modified-By        The user that comitted the object's version
793
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
794
X-Object-Sharing            Object permissions (optional)
795
X-Object-Shared-By          Object inheriting permissions (optional)
796
X-Object-Allowed-To         Allowed actions on object (optional)
797
X-Object-Public             Object's publicly accessible URI (optional)
798
X-Object-Meta-*             Optional user defined metadata
799
==========================  ===============================
800

    
801
Sharing headers (``X-Object-Sharing``, ``X-Object-Shared-By`` and ``X-Object-Allowed-To``) are only included if the request is for the object's latest version (no specific ``version`` parameter is set).
802

    
803
===========================  ==============================
804
Return Code                  Description
805
===========================  ==============================
806
200 (OK)                     The request succeeded
807
206 (Partial Content)        The range request succeeded
808
304 (Not Modified)           The object has not been modified
809
412 (Precondition Failed)    The condition set can not be satisfied
810
416 (Range Not Satisfiable)  The requested range is out of limits
811
===========================  ==============================
812

    
813

    
814
PUT
815
"""
816

    
817
====================  ================================
818
Request Header Name   Value
819
====================  ================================
820
If-Match              Put if ETags match with current object
821
If-None-Match         Put if ETags don't match with current object
822
ETag                  The MD5 hash of the object (optional to check written data)
823
Content-Length        The size of the data written
824
Content-Type          The MIME content type of the object
825
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
826
X-Copy-From           The source path in the form ``/<container>/<object>``
827
X-Move-From           The source path in the form ``/<container>/<object>``
828
X-Source-Account      The source account to copy/move from
829
X-Source-Version      The source version to copy from
830
Content-Encoding      The encoding of the object (optional)
831
Content-Disposition   The presentation style of the object (optional)
832
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
833
X-Object-Sharing      Object permissions (optional)
834
X-Object-Public       Object is publicly accessible (optional)
835
X-Object-Meta-*       Optional user defined metadata
836
====================  ================================
837

    
838
|
839

    
840
======================  ===================================
841
Request Parameter Name  Value
842
======================  ===================================
843
format                  Optional extended request type (can be ``json`` or ``xml``)
844
hashmap                 Optional hashmap provided instead of data (no value parameter)
845
======================  ===================================
846

    
847
The request is the object's data (or part of it), except if a hashmap is provided (using ``hashmap`` and ``format`` parameters). If using a hashmap and all different parts are stored in the server, the object is created, otherwise the server returns Conflict (409) with the list of the missing parts (in a simple text format, with one hash per line).
848

    
849
Hashmaps should be formatted as outlined in ``GET``.
850

    
851
==========================  ===============================
852
Reply Header Name           Value
853
==========================  ===============================
854
ETag                        The MD5 hash of the object
855
X-Object-Version            The object's new version
856
==========================  ===============================
857

    
858
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). Groups are specified as ``<account>:<group>``. To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
859

    
860
==============================  ==============================
861
Return Code                     Description
862
==============================  ==============================
863
201 (Created)                   The object has been created
864
409 (Conflict)                  The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a list of conflicting sharing paths will be included in the reply - in simple text format)
865
411 (Length Required)           Missing ``Content-Length`` or ``Content-Type`` in the request
866
413 (Request Entity Too Large)  Insufficient quota to complete the request
867
422 (Unprocessable Entity)      The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
868
==============================  ==============================
869

    
870

    
871
COPY
872
""""
873

    
874
======================  ===================================
875
Request Parameter Name  Value
876
======================  ===================================
877
ignore_content_type     Ignore the supplied Content-Type
878
======================  ===================================
879

    
880
|
881

    
882
====================  ================================
883
Request Header Name   Value
884
====================  ================================
885
If-Match              Proceed if ETags match with object
886
If-None-Match         Proceed if ETags don't match with object
887
Destination           The destination path in the form ``/<container>/<object>``
888
Destination-Account   The destination account to copy to
889
Content-Type          The MIME content type of the object (optional :sup:`*`)
890
Content-Encoding      The encoding of the object (optional)
891
Content-Disposition   The presentation style of the object (optional)
892
X-Source-Version      The source version to copy from
893
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
894
X-Object-Sharing      Object permissions (optional)
895
X-Object-Public       Object is publicly accessible (optional)
896
X-Object-Meta-*       Optional user defined metadata
897
====================  ================================
898

    
899
:sup:`*` *When using django locally with the supplied web server, use the ignore_content_type parameter, or do provide a valid Content-Type, as a type of text/plain is applied by default to all requests. Client software should always state ignore_content_type, except when a Content-Type is explicitly defined by the user.*
900

    
901
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
902

    
903
==========================  ===============================
904
Reply Header Name           Value
905
==========================  ===============================
906
X-Object-Version            The object's new version
907
==========================  ===============================
908

    
909
|
910

    
911
==============================  ==============================
912
Return Code                     Description
913
==============================  ==============================
914
201 (Created)                   The object has been created
915
409 (Conflict)                  There are conflicting permissions (a list of conflicting sharing paths will be included in the reply - in simple text format)
916
413 (Request Entity Too Large)  Insufficient quota to complete the request
917
==============================  ==============================
918

    
919

    
920
MOVE
921
""""
922

    
923
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
924

    
925

    
926
POST
927
""""
928

    
929
====================  ================================
930
Request Header Name   Value
931
====================  ================================
932
If-Match              Proceed if ETags match with object
933
If-None-Match         Proceed if ETags don't match with object
934
Content-Length        The size of the data written (optional, to update)
935
Content-Type          The MIME content type of the object (optional, to update)
936
Content-Range         The range of data supplied (optional, to update)
937
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
938
Content-Encoding      The encoding of the object (optional)
939
Content-Disposition   The presentation style of the object (optional)
940
X-Source-Object       Update with data from the object at path ``/<container>/<object>`` (optional, to update)
941
X-Source-Account      The source account to update from
942
X-Source-Version      The source version to update from (optional, to update)
943
X-Object-Bytes        The updated object's final size (optional, when updating)
944
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
945
X-Object-Sharing      Object permissions (optional)
946
X-Object-Public       Object is publicly accessible (optional)
947
X-Object-Meta-*       Optional user defined metadata
948
====================  ================================
949

    
950
|
951

    
952
======================  ============================================
953
Request Parameter Name  Value
954
======================  ============================================
955
update                  Do not replace metadata (no value parameter)
956
======================  ============================================
957

    
958
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
959

    
960
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
961

    
962
To update an object's data:
963

    
964
* Either set ``Content-Type`` to ``application/octet-stream``, or provide an object with ``X-Source-Object``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
965
* If the data is supplied in the request (using ``Content-Type`` instead of ``X-Source-Object``), a valid ``Content-Length`` header is required - except if using chunked transfers (set ``Transfer-Encoding`` to ``chunked``).
966
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
967

    
968
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
969
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
970
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data will be appended to the object.
971

    
972
Optionally, truncate the updated object to the desired length with the ``X-Object-Bytes`` header.
973

    
974
A data update will trigger an ETag change. Updated ETags may happen asynchronously and appear at the server with a delay.
975

    
976
No reply content. No reply headers if only metadata is updated.
977

    
978
==========================  ===============================
979
Reply Header Name           Value
980
==========================  ===============================
981
ETag                        The new ETag of the object (data updated)
982
X-Object-Version            The object's new version
983
==========================  ===============================
984

    
985
|
986

    
987
==============================  ==============================
988
Return Code                     Description
989
==============================  ==============================
990
202 (Accepted)                  The request has been accepted (not a data update)
991
204 (No Content)                The request succeeded (data updated)
992
409 (Conflict)                  There are conflicting permissions (a list of conflicting sharing paths will be included in the reply - in simple text format)
993
411 (Length Required)           Missing ``Content-Length`` in the request
994
413 (Request Entity Too Large)  Insufficient quota to complete the request
995
416 (Range Not Satisfiable)     The supplied range is invalid
996
==============================  ==============================
997

    
998
The ``POST`` method can also be used for creating an object via a standard HTML form. If the request ``Content-Type`` is ``multipart/form-data``, none of the above headers will be processed. The form should have an ``X-Object-Data`` field, as in the following example. The token is passed as a request parameter. ::
999

    
1000
  <form method="post" action="https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt?X-Auth-Token=0000" enctype="multipart/form-data">
1001
    <input type="file" name="X-Object-Data">
1002
    <input type="submit">
1003
  </form>
1004

    
1005
This will create/override the object with the given name, as if using ``PUT``. The ``Content-Type`` of the object will be set to the value of the corresponding header sent in the part of the request containing the data (usually, automatically handled by the browser). Metadata, sharing and other object attributes can not be set this way.
1006

    
1007
==========================  ===============================
1008
Reply Header Name           Value
1009
==========================  ===============================
1010
ETag                        The MD5 hash of the object
1011
X-Object-Version            The object's new version
1012
==========================  ===============================
1013

    
1014
|
1015

    
1016
==============================  ==============================
1017
Return Code                     Description
1018
==============================  ==============================
1019
201 (Created)                   The object has been created
1020
413 (Request Entity Too Large)  Insufficient quota to complete the request
1021
==============================  ==============================
1022

    
1023

    
1024
DELETE
1025
""""""
1026

    
1027
======================  ===================================
1028
Request Parameter Name  Value
1029
======================  ===================================
1030
until                   Optional timestamp
1031
======================  ===================================
1032

    
1033
If ``until`` is defined, the object is "purged" up to that time (the history up to then is deleted).
1034

    
1035
No reply content/headers.
1036

    
1037
===========================  ==============================
1038
Return Code                  Description
1039
===========================  ==============================
1040
204 (No Content)             The request succeeded
1041
===========================  ==============================
1042

    
1043
Sharing and Public Objects
1044
^^^^^^^^^^^^^^^^^^^^^^^^^^
1045

    
1046
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
1047

    
1048
A user may ``GET`` another account or container. The result will include a limited reply, containing only the allowed containers or objects respectively. A top-level request with an authentication token, will return a list of allowed accounts, so the user can easily find out which other users share objects. The ``X-Object-Allowed-To`` header lists the actions allowed on an object, if it does not belong to the requesting user.
1049

    
1050
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden):
1051

    
1052
==========================  ===============================
1053
Reply Header Name           Value
1054
==========================  ===============================
1055
ETag                        The ETag of the object
1056
Content-Length              The size of the data returned
1057
Content-Type                The MIME content type of the object
1058
Content-Range               The range of data included (only on a single range request)
1059
Last-Modified               The last object modification date (regardless of version)
1060
Content-Encoding            The encoding of the object (optional)
1061
Content-Disposition         The presentation style of the object (optional)
1062
==========================  ===============================
1063

    
1064
Public objects are not included and do not influence cross-user listings. They are, however, readable by all users.
1065

    
1066
Summary
1067
^^^^^^^
1068

    
1069
List of differences from the OOS API:
1070

    
1071
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
1072
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
1073
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings. (**TBD**)
1074
* Account and container policies to manage behavior and limits. Container behavior overrides account settings. Account quota sets the maximum bytes limit, regardless of container values.
1075
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
1076
* All metadata replies, at all levels, include latest modification information.
1077
* At all levels, a ``HEAD`` or ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
1078
* Container/object lists include all associated metadata if the reply is of type JSON/XML. Some names are kept to their OOS API equivalents for compatibility.
1079
* Option to include only shared containers/objects in listings.
1080
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
1081
* Multi-range object ``GET`` support as outlined in RFC2616.
1082
* Object hashmap retrieval through ``GET`` and the ``format`` parameter.
1083
* Object create via hashmap through ``PUT`` and the ``format`` parameter.
1084
* The object's Merkle hash is always returned in the ``X-Object-Hash`` header.
1085
* The object's UUID is always returned in the ``X-Object-UUID`` header. The UUID remains unchanged, even when the object's data or metadata changes, or the object is moved to another path (is renamed). A new UUID is assigned when creating or copying an object.
1086
* Object create using ``POST`` to support standard HTML forms.
1087
* Partial object updates through ``POST``, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers. Use another object's data to update with ``X-Source-Object`` and ``X-Source-Version``. Truncate with ``X-Object-Bytes``.
1088
* Include new version identifier in replies for object replace/change requests.
1089
* Object ``MOVE`` support and ``ignore_content_type`` parameter in both ``COPY`` and ``MOVE``.
1090
* Conditional object create/update operations, using ``If-Match`` and ``If-None-Match`` headers.
1091
* Time-variant account/container listings via the ``until`` parameter.
1092
* Object versions - parameter ``version`` in ``HEAD``/``GET`` (list versions with ``GET``), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in ``PUT``/``COPY``.
1093
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Cross-user operations are allowed - controlled by sharing directives. Available actions in cross-user requests are reported with ``X-Object-Allowed-To``. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
1094
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
1095
* Copy and move between accounts with ``X-Source-Account`` and ``Destination-Account`` headers.
1096
* Large object support with ``X-Object-Manifest``.
1097
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
1098
* Purge container/object history with the ``until`` parameter in ``DELETE``.
1099

    
1100
Clarifications/suggestions:
1101

    
1102
* All non-ASCII characters in headers should be URL-encoded.
1103
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
1104
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
1105
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
1106
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
1107
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired request/reply format. The parameter overrides the header.
1108
* Container/object lists use a ``200`` return code if the reply is of type JSON/XML. The reply will include an empty JSON/XML.
1109
* In headers, dates are formatted according to RFC 1123. In extended information listings, the ``last_modified`` field is formatted according to ISO 8601 (for OOS API compatibility). All other fields (Pithos extensions) use integer tiemstamps.
1110
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
1111
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
1112
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
1113

    
1114
The Pithos Client
1115
-----------------
1116

    
1117
User Experience
1118
^^^^^^^^^^^^^^^
1119

    
1120
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
1121

    
1122
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
1123

    
1124
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
1125
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
1126
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
1127
* The ``others`` element, which contains all objects that other users share with the user.
1128
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
1129
* The ``history`` element, which allows browsing past instances of ``home`` and - optionally - ``trash``.
1130

    
1131
Objects in Pithos can be:
1132

    
1133
* Moved to trash and then deleted.
1134
* Shared with specific permissions.
1135
* Made public (shared with non-Pithos users).
1136
* Restored from previous versions.
1137

    
1138
Some of these functions are performed by the client software and some by the Pithos server.
1139

    
1140
In the first version of Pithos, objects could also be assigned custom tags. This is no longer supported. Existing deployments can migrate tags into a specific metadata value, i.e. ``X-Object-Meta-Tags``.
1141

    
1142
Implementation Guidelines
1143
^^^^^^^^^^^^^^^^^^^^^^^^^
1144

    
1145
Pithos clients should use the ``pithos`` and ``trash`` containers for active and inactive objects respectively. If any of these containers is not found, the client software should create it, without interrupting the user's workflow. The ``home`` element corresponds to ``pithos`` and the ``trash`` element to ``trash``. Use ``PUT`` with the ``X-Move-From`` header, or ``MOVE`` to transfer objects from one container to the other. Use ``DELETE`` to remove from ``pithos`` without trashing, or to remove from ``trash``. When moving objects, detect naming conflicts with the ``If-Match`` or ``If-None-Match`` headers. Such conflicts should be resolved by the user.
1146

    
1147
Object names should use the ``/`` delimiter to impose a hierarchy of folders and files.
1148

    
1149
The ``shared`` element should be implemented as a read-only view of the ``pithos`` container, using the ``shared`` parameter when listing objects. The ``others`` element, should start with a top-level ``GET`` to retrieve the list of accounts accessible to the user. It is suggested that the client software hides the next step of navigation - the container - if it only includes ``pithos`` and forwards the user directly to the objects.
1150

    
1151
Public objects are not included in ``shared`` and ``others`` listings. It is suggested that they are marked in a visually distinctive way in ``pithos`` listings (for example using an icon overlay).
1152

    
1153
A special application menu, or a section in application preferences, should be devoted to managing groups (the ``groups`` element). All group-related actions are implemented at the account level.
1154

    
1155
Browsing past versions of objects should be available both at the object and the container level. At the object level, a list of past versions can be included in the screen showing details or more information on the object (metadata, permissions, etc.). At the container level, it is suggested that clients use a ``history`` element, which presents to the user a read-only, time-variable view of ``pithos`` contents. This can be accomplished via the ``until`` parameter in listings. Optionally, ``history`` may include ``trash``.
1156

    
1157
Uploading and downloading data
1158
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1159

    
1160
By using hashmaps to upload and download objects the corresponding operations can complete much faster.
1161

    
1162
In the case of an upload, only the missing blocks will be submitted to the server:
1163

    
1164
* Calculate the hash value for each block of the object to be uploaded. Use the hash algorithm and block size of the destination container.
1165
* Send a hashmap ``PUT`` request for the object.
1166

    
1167
  * Server responds with status ``201`` (Created):
1168

    
1169
    * Blocks are already on the server. The object has been created. Done.
1170

    
1171
  * Server responds with status ``409`` (Conflict):
1172

    
1173
    * Server's response body contains the hashes of the blocks that do not exist on the server.
1174
    * For each hash value in the server's response (or all hashes together):
1175

    
1176
      * Send a ``POST`` request to the destination container with the corresponding data.
1177

    
1178
* Repeat hashmap ``PUT``. Fail if the server's response is not ``201``.
1179

    
1180
Consulting hashmaps when downloading allows for resuming partially transferred objects. The client should retrieve the hashmap from the server and compare it with the hashmap computed from the respective local file. Any missing parts can be downloaded with ``GET`` requests with the additional ``Range`` header.
1181

    
1182
Syncing
1183
^^^^^^^
1184

    
1185
Consider the following algorithm for synchronizing a local folder with the server. The "state" is the complete object listing, with the corresponding attributes.
1186
 
1187
::
1188

    
1189
  L: local state (stored state from last sync with the server)
1190
  C: current state (state computed right before sync)
1191
  S: server state
1192

    
1193
  if C == L:
1194
      # No local changes
1195
      if S == L:
1196
          # No remote changes, nothing to do
1197
      else:
1198
          # Update local state to match that of the server
1199
         L = S
1200
  else:
1201
      # We have local changes
1202
      if S == L:
1203
          # No remote changes, update the server
1204
          S = C
1205
          L = S
1206
      else:
1207
          # Both we and server have changes
1208
          if C == S:
1209
              # We were lucky, we did the same change
1210
              L = S
1211
          else:
1212
              # We have conflicting changes
1213
              resolve conflict
1214

    
1215
Notes:
1216

    
1217
* States represent file hashes (it is suggested to use Merkle). Deleted or non-existing files are assumed to have a magic hash (e.g. empty string).
1218
* Updating a state (either local or remote) implies downloading, uploading or deleting the appropriate file.
1219

    
1220
Recommended Practices and Examples
1221
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1222

    
1223
Assuming an authentication token is obtained, the following high-level operations are available - shown with ``curl``:
1224

    
1225
* Get account information ::
1226

    
1227
    curl -X HEAD -D - \
1228
         -H "X-Auth-Token: 0000" \
1229
         https://pithos.dev.grnet.gr/v1/user
1230

    
1231
* List available containers ::
1232

    
1233
    curl -X GET -D - \
1234
         -H "X-Auth-Token: 0000" \
1235
         https://pithos.dev.grnet.gr/v1/user
1236

    
1237
* Get container information ::
1238

    
1239
    curl -X HEAD -D - \
1240
         -H "X-Auth-Token: 0000" \
1241
         https://pithos.dev.grnet.gr/v1/user/pithos
1242

    
1243
* Add a new container ::
1244

    
1245
    curl -X PUT -D - \
1246
         -H "X-Auth-Token: 0000" \
1247
         https://pithos.dev.grnet.gr/v1/user/test
1248

    
1249
* Delete a container ::
1250

    
1251
    curl -X DELETE -D - \
1252
         -H "X-Auth-Token: 0000" \
1253
         https://pithos.dev.grnet.gr/v1/user/test
1254

    
1255
* List objects in a container ::
1256

    
1257
    curl -X GET -D - \
1258
         -H "X-Auth-Token: 0000" \
1259
         https://pithos.dev.grnet.gr/v1/user/pithos
1260

    
1261
* List objects in a container (extended reply) ::
1262

    
1263
    curl -X GET -D - \
1264
         -H "X-Auth-Token: 0000" \
1265
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
1266

    
1267
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
1268

    
1269
* List metadata keys used by objects in a container
1270

    
1271
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``). (**TBD**)
1272

    
1273
* List objects in a container having a specific meta defined ::
1274

    
1275
    curl -X GET -D - \
1276
         -H "X-Auth-Token: 0000" \
1277
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=favorites
1278

    
1279
* Retrieve an object ::
1280

    
1281
    curl -X GET -D - \
1282
         -H "X-Auth-Token: 0000" \
1283
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
1284

    
1285
* Retrieve an object (specific ranges of data) ::
1286

    
1287
    curl -X GET -D - \
1288
         -H "X-Auth-Token: 0000" \
1289
         -H "Range: bytes=0-9" \
1290
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
1291

    
1292
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
1293

    
1294
* Add a new object (folder type) (**TBD**) ::
1295

    
1296
    curl -X PUT -D - \
1297
         -H "X-Auth-Token: 0000" \
1298
         -H "Content-Type: application/directory" \
1299
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
1300

    
1301
* Add a new object ::
1302

    
1303
    curl -X PUT -D - \
1304
         -H "X-Auth-Token: 0000" \
1305
         -H "Content-Type: text/plain" \
1306
         -T EXAMPLE.txt
1307
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
1308

    
1309
* Update an object ::
1310

    
1311
    curl -X POST -D - \
1312
         -H "X-Auth-Token: 0000" \
1313
         -H "Content-Length: 10" \
1314
         -H "Content-Type: application/octet-stream" \
1315
         -H "Content-Range: bytes 10-19/*" \
1316
         -d "0123456789" \
1317
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1318

    
1319
  This will update bytes 10-19 with the data specified.
1320

    
1321
* Update an object (append) ::
1322

    
1323
    curl -X POST -D - \
1324
         -H "X-Auth-Token: 0000" \
1325
         -H "Content-Length: 10" \
1326
         -H "Content-Type: application/octet-stream" \
1327
         -H "Content-Range: bytes */*" \
1328
         -d "0123456789" \
1329
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1330

    
1331
* Update an object (truncate) ::
1332

    
1333
    curl -X POST -D - \
1334
         -H "X-Auth-Token: 0000" \
1335
         -H "X-Source-Object: /folder/EXAMPLE.txt" \
1336
         -H "Content-Range: bytes 0-0/*" \
1337
         -H "X-Object-Bytes: 0" \
1338
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1339

    
1340
  This will truncate the object to 0 bytes.
1341

    
1342
* Add object metadata ::
1343

    
1344
    curl -X POST -D - \
1345
         -H "X-Auth-Token: 0000" \
1346
         -H "X-Object-Meta-First: first_meta_value" \
1347
         -H "X-Object-Meta-Second: second_meta_value" \
1348
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1349

    
1350
* Delete object metadata ::
1351

    
1352
    curl -X POST -D - \
1353
         -H "X-Auth-Token: 0000" \
1354
         -H "X-Object-Meta-First: first_meta_value" \
1355
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1356

    
1357
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
1358

    
1359
* Delete an object ::
1360

    
1361
    curl -X DELETE -D - \
1362
         -H "X-Auth-Token: 0000" \
1363
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt