Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ 1993fea9

History | View | Annotate | Download (46.3 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
Document Revisions
23
^^^^^^^^^^^^^^^^^^
24

    
25
=========================  ================================
26
Revision                   Description
27
=========================  ================================
28
0.4 (June 30, 2011)        Object permissions and account groups.
29
\                          Control versioning behavior and container quotas with container policy directives.
30
\                          Support updating/deleting individual metadata with ``POST``.
31
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
32
\                          Allow for publicly available objects via ``https://hostname/public``.
33
\                          Support time-variant account/container listings. 
34
\                          Add source version when duplicating with PUT/COPY.
35
\                          Request version in object HEAD/GET requests (list versions with GET).
36
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
37
\                          Include underlying storage characteristics in container meta.
38
\                          Support for partial object updates through POST.
39
\                          Expose object hashmaps through GET.
40
\                          Support for multi-range object GET requests.
41
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
42
=========================  ================================
43

    
44
The Pithos API
45
--------------
46

    
47
The URI requests supported by the Pithos API follow one of the following forms:
48

    
49
* Top level: ``https://hostname/v1/``
50
* Account level: ``https://hostname/v1/<account>``
51
* Container level: ``https://hostname/v1/<account>/<container>``
52
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
53

    
54
All requests must include an ``X-Auth-Token``. The process of obtaining the token is still to be determined (**TBD**).
55

    
56
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
57

    
58
=========================  ================================
59
Return Code                Description
60
=========================  ================================
61
400 (Bad Request)          The request is invalid
62
401 (Unauthorized)         Request not allowed
63
404 (Not Found)            The requested resource was not found
64
503 (Service Unavailable)  The request cannot be completed because of an internal error
65
=========================  ================================
66

    
67
Top Level
68
^^^^^^^^^
69

    
70
List of operations:
71

    
72
=========  ==================
73
Operation  Description
74
=========  ==================
75
GET        Authentication. This is kept for compatibility with the OOS API
76
=========  ==================
77

    
78
GET
79
"""
80

    
81
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
82

    
83
================  =====================
84
Return Code       Description
85
================  =====================
86
204 (No Content)  The request succeeded
87
================  =====================
88

    
89

    
90
Account Level
91
^^^^^^^^^^^^^
92

    
93
List of operations:
94

    
95
=========  ==================
96
Operation  Description
97
=========  ==================
98
HEAD       Retrieve account metadata
99
GET        List containers
100
POST       Update account metadata
101
=========  ==================
102

    
103
HEAD
104
""""
105

    
106
======================  ===================================
107
Request Parameter Name  Value
108
======================  ===================================
109
until                   Optional timestamp
110
======================  ===================================
111

    
112
|
113

    
114
==========================  =====================
115
Reply Header Name           Value
116
==========================  =====================
117
X-Account-Container-Count   The total number of containers
118
X-Account-Object-Count      The total number of objects (**TBD**)
119
X-Account-Bytes-Used        The total number of bytes stored
120
X-Account-Bytes-Remaining   The total number of bytes remaining (**TBD**)
121
X-Account-Last-Login        The last login (**TBD**)
122
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
123
X-Account-Group-*           Optional user defined groups
124
X-Account-Meta-*            Optional user defined metadata
125
Last-Modified               The last account modification date (regardless of ``until``)
126
==========================  =====================
127

    
128
|
129

    
130
================  =====================
131
Return Code       Description
132
================  =====================
133
204 (No Content)  The request succeeded
134
================  =====================
135

    
136

    
137
GET
138
"""
139

    
140
====================  ===========================
141
Request Header Name   Value
142
====================  ===========================
143
If-Modified-Since     Retrieve if account has changed since provided timestamp
144
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
145
====================  ===========================
146

    
147
|
148

    
149
======================  =========================
150
Request Parameter Name  Value
151
======================  =========================
152
limit                   The amount of results requested (default is 10000)
153
marker                  Return containers with name lexicographically after marker
154
format                  Optional extended reply type (can be ``json`` or ``xml``)
155
until                   Optional timestamp
156
======================  =========================
157

    
158
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
159
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
160
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
161

    
162
===========================  ============================
163
Name                         Description
164
===========================  ============================
165
name                         The name of the container
166
count                        The number of objects inside the container
167
bytes                        The total size of the objects inside the container
168
last_modified                The last container modification date (regardless of ``until``)
169
x_container_until_timestamp  The last container modification date until the timestamp provided
170
x_container_policy_*         Container behavior and limits
171
x_container_meta_*           Optional user defined metadata
172
===========================  ============================
173

    
174
For examples of container details returned in JSON/XML formats refer to the OOS API documentation.
175

    
176
===========================  =====================
177
Return Code                  Description
178
===========================  =====================
179
200 (OK)                     The request succeeded
180
204 (No Content)             The account has no containers (only for non-extended replies)
181
304 (Not Modified)           The account has not been modified
182
412 (Precondition Failed)    The condition set can not be satisfied
183
===========================  =====================
184

    
185
Will use a ``200`` return code if the reply is of type json/xml.
186

    
187

    
188
POST
189
""""
190

    
191
======================  ============================================
192
Request Parameter Name  Value
193
======================  ============================================
194
update                  Do not replace metadata/groups (no value parameter)
195
======================  ============================================
196

    
197
|
198

    
199
====================  ===========================
200
Request Header Name   Value
201
====================  ===========================
202
X-Account-Group-*     Optional user defined groups
203
X-Account-Meta-*      Optional user defined metadata
204
====================  ===========================
205

    
206
No reply content/headers.
207

    
208
The operation will overwrite all user defined metadata, except if ``update`` is defined.
209
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
210

    
211
================  ===============================
212
Return Code       Description
213
================  ===============================
214
202 (Accepted)    The request has been accepted
215
================  ===============================
216

    
217

    
218
Container Level
219
^^^^^^^^^^^^^^^
220

    
221
List of operations:
222

    
223
=========  ============================
224
Operation  Description
225
=========  ============================
226
HEAD       Retrieve container metadata
227
GET        List objects
228
PUT        Create/update container
229
POST       Update container metadata
230
DELETE     Delete container
231
=========  ============================
232

    
233

    
234
HEAD
235
""""
236

    
237
======================  ===================================
238
Request Parameter Name  Value
239
======================  ===================================
240
until                   Optional timestamp
241
======================  ===================================
242

    
243
|
244

    
245
===========================  ===============================
246
Reply Header Name            Value
247
===========================  ===============================
248
X-Container-Object-Count     The total number of objects in the container
249
X-Container-Bytes-Used       The total number of bytes of all objects stored
250
X-Container-Block-Size       The block size used by the storage backend
251
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
252
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
253
X-Container-Object-Meta      A list with all meta keys used by objects
254
X-Container-Policy-*         Container behavior and limits
255
X-Container-Meta-*           Optional user defined metadata
256
Last-Modified                The last container modification date (regardless of ``until``)
257
===========================  ===============================
258

    
259
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix. See container ``PUT`` for a reference of policy directives.
260

    
261
================  ===============================
262
Return Code       Description
263
================  ===============================
264
204 (No Content)  The request succeeded
265
================  ===============================
266

    
267

    
268
GET
269
"""
270

    
271
====================  ===========================
272
Request Header Name   Value
273
====================  ===========================
274
If-Modified-Since     Retrieve if container has changed since provided timestamp
275
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
276
====================  ===========================
277

    
278
|
279

    
280
======================  ===================================
281
Request Parameter Name  Value
282
======================  ===================================
283
limit                   The amount of results requested (default is 10000)
284
marker                  Return containers with name lexicographically after marker
285
prefix                  Return objects starting with prefix
286
delimiter               Return objects up to the delimiter (discussion follows)
287
path                    Assume ``prefix=path`` and ``delimiter=/``
288
format                  Optional extended reply type (can be ``json`` or ``xml``)
289
meta                    Return objects having the specified meta keys (can be a comma separated list)
290
until                   Optional timestamp
291
======================  ===================================
292

    
293
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
294

    
295
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
296

    
297
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
298
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
299
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
300

    
301
==========================  ======================================
302
Name                        Description
303
==========================  ======================================
304
name                        The name of the object
305
hash                        The ETag of the object
306
bytes                       The size of the object
307
content_type                The MIME content type of the object
308
content_encoding            The encoding of the object (optional)
309
content-disposition         The presentation style of the object (optional)
310
last_modified               The last object modification date (regardless of version)
311
x_object_version            The object's version identifier
312
x_object_version_timestamp  The object's version timestamp
313
x_object_modified_by        The user that committed the object's version
314
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
315
x_object_sharing            Object permissions (optional)
316
x_object_shared_by          Object inheriting permissions (optional)
317
x_object_public             Object's publicly accessible URI (optional)
318
x_object_meta_*             Optional user defined metadata
319
==========================  ======================================
320

    
321
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
322
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
323
In JSON results they appear as dictionaries with only a ``"subdir"`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
324
In case there is an object with the same name as a virtual directory marker, the object will be returned.
325

    
326
For examples of object details returned in JSON/XML formats refer to the OOS API documentation.
327

    
328
===========================  ===============================
329
Return Code                  Description
330
===========================  ===============================
331
200 (OK)                     The request succeeded
332
204 (No Content)             The account has no containers (only for non-extended replies)
333
304 (Not Modified)           The container has not been modified
334
412 (Precondition Failed)    The condition set can not be satisfied
335
===========================  ===============================
336

    
337
Will use a ``200`` return code if the reply is of type json/xml.
338

    
339

    
340
PUT
341
"""
342

    
343
====================  ================================
344
Request Header Name   Value
345
====================  ================================
346
X-Container-Policy-*  Container behavior and limits
347
X-Container-Meta-*    Optional user defined metadata
348
====================  ================================
349
 
350
No reply content/headers.
351

    
352
If no policy is defined, the container will be created with the default values.
353
Available policy directives:
354

    
355
* ``versioning``: Set to ``auto``, ``manual`` or ``none`` (default is ``manual``)
356
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
357
 
358
================  ===============================
359
Return Code       Description
360
================  ===============================
361
201 (Created)     The container has been created
362
202 (Accepted)    The request has been accepted
363
================  ===============================
364

    
365

    
366
POST
367
""""
368

    
369
======================  ============================================
370
Request Parameter Name  Value
371
======================  ============================================
372
update                  Do not replace metadata/policy (no value parameter)
373
======================  ============================================
374

    
375
|
376

    
377
====================  ================================
378
Request Header Name   Value
379
====================  ================================
380
X-Container-Policy-*  Container behavior and limits
381
X-Container-Meta-*    Optional user defined metadata
382
====================  ================================
383

    
384
No reply content/headers.
385

    
386
The operation will overwrite all user defined metadata, except if ``update`` is defined.
387
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
388

    
389
================  ===============================
390
Return Code       Description
391
================  ===============================
392
202 (Accepted)    The request has been accepted
393
================  ===============================
394

    
395

    
396
DELETE
397
""""""
398

    
399
No request parameters/headers.
400

    
401
No reply content/headers.
402

    
403
================  ===============================
404
Return Code       Description
405
================  ===============================
406
204 (No Content)  The request succeeded
407
409 (Conflict)    The container is not empty
408
================  ===============================
409

    
410

    
411
Object Level
412
^^^^^^^^^^^^
413

    
414
List of operations:
415

    
416
=========  =================================
417
Operation  Description
418
=========  =================================
419
HEAD       Retrieve object metadata
420
GET        Read object data
421
PUT        Write object data or copy/move object
422
COPY       Copy object
423
MOVE       Move object
424
POST       Update object metadata/data
425
DELETE     Delete object
426
=========  =================================
427

    
428

    
429
HEAD
430
""""
431

    
432
======================  ===================================
433
Request Parameter Name  Value
434
======================  ===================================
435
version                 Optional version identifier
436
======================  ===================================
437

    
438
|
439

    
440
==========================  ===============================
441
Reply Header Name           Value
442
==========================  ===============================
443
ETag                        The ETag of the object
444
Content-Length              The size of the object
445
Content-Type                The MIME content type of the object
446
Last-Modified               The last object modification date (regardless of version)
447
Content-Encoding            The encoding of the object (optional)
448
Content-Disposition         The presentation style of the object (optional)
449
X-Object-Version            The object's version identifier
450
X-Object-Version-Timestamp  The object's version timestamp
451
X-Object-Modified-By        The user that comitted the object's version
452
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
453
X-Object-Sharing            Object permissions (optional)
454
X-Object-Shared-By          Object inheriting permissions (optional)
455
X-Object-Public             Object's publicly accessible URI (optional)
456
X-Object-Meta-*             Optional user defined metadata
457
==========================  ===============================
458

    
459
|
460

    
461
================  ===============================
462
Return Code       Description
463
================  ===============================
464
200 (No Content)  The request succeeded
465
================  ===============================
466

    
467

    
468
GET
469
"""
470

    
471
====================  ================================
472
Request Header Name   Value
473
====================  ================================
474
Range                 Optional range of data to retrieve
475
If-Match              Retrieve if ETags match
476
If-None-Match         Retrieve if ETags don't match
477
If-Modified-Since     Retrieve if object has changed since provided timestamp
478
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
479
====================  ================================
480

    
481
|
482

    
483
======================  ===================================
484
Request Parameter Name  Value
485
======================  ===================================
486
format                  Optional extended reply type (can be ``json`` or ``xml``)
487
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
488
======================  ===================================
489

    
490
The reply is the object's data (or part of it), except if a hashmap is requested with the ``format`` parameter, or a version list with ``version=list`` (in which case an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
491

    
492
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block.
493

    
494
Example ``format=json`` reply:
495

    
496
::
497

    
498
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
499

    
500
Example ``format=xml`` reply:
501

    
502
::
503

    
504
  <?xml version="1.0" encoding="UTF-8"?>
505
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
506
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
507
    <hash>...</hash>
508
  </object>
509

    
510
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
511

    
512
Example ``format=json`` reply:
513

    
514
::
515

    
516
  {"versions": [[23, 1307700892], [28, 1307700898], ...]}
517

    
518
Example ``format=xml`` reply:
519

    
520
::
521

    
522
  <?xml version="1.0" encoding="UTF-8"?>
523
  <object name="file">
524
    <version timestamp="1307700892">23</version>
525
    <version timestamp="1307700898">28</version>
526
    <version timestamp="...">...</version>
527
  </object>
528

    
529
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
530

    
531
==========================  ===============================
532
Reply Header Name           Value
533
==========================  ===============================
534
ETag                        The ETag of the object
535
Content-Length              The size of the data returned
536
Content-Type                The MIME content type of the object
537
Content-Range               The range of data included (only on a single range request)
538
Last-Modified               The last object modification date (regardless of version)
539
Content-Encoding            The encoding of the object (optional)
540
Content-Disposition         The presentation style of the object (optional)
541
X-Object-Version            The object's version identifier
542
X-Object-Version-Timestamp  The object's version timestamp
543
X-Object-Modified-By        The user that comitted the object's version
544
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
545
X-Object-Sharing            Object permissions (optional)
546
X-Object-Shared-By          Object inheriting permissions (optional)
547
X-Object-Public             Object's publicly accessible URI (optional)
548
X-Object-Meta-*             Optional user defined metadata
549
==========================  ===============================
550

    
551
|
552

    
553
===========================  ==============================
554
Return Code                  Description
555
===========================  ==============================
556
200 (OK)                     The request succeeded
557
206 (Partial Content)        The range request succeeded
558
304 (Not Modified)           The object has not been modified
559
412 (Precondition Failed)    The condition set can not be satisfied
560
416 (Range Not Satisfiable)  The requested range is out of limits
561
===========================  ==============================
562

    
563

    
564
PUT
565
"""
566

    
567
====================  ================================
568
Request Header Name   Value
569
====================  ================================
570
ETag                  The MD5 hash of the object (optional to check written data)
571
Content-Length        The size of the data written
572
Content-Type          The MIME content type of the object
573
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
574
X-Copy-From           The source path in the form ``/<container>/<object>``
575
X-Move-From           The source path in the form ``/<container>/<object>``
576
X-Source-Version      The source version to copy from
577
Content-Encoding      The encoding of the object (optional)
578
Content-Disposition   The presentation style of the object (optional)
579
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
580
X-Object-Sharing      Object permissions (optional)
581
X-Object-Public       Object is publicly accessible (optional)
582
X-Object-Meta-*       Optional user defined metadata
583
====================  ================================
584

    
585
|
586

    
587
==========================  ===============================
588
Reply Header Name           Value
589
==========================  ===============================
590
ETag                        The MD5 hash of the object (on create)
591
==========================  ===============================
592

    
593
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
594

    
595
===========================  ==============================
596
Return Code                  Description
597
===========================  ==============================
598
201 (Created)                The object has been created
599
409 (Conflict)               The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a conflicting sharing path will be included in the reply - in JSON format)
600
411 (Length Required)        Missing ``Content-Length`` or ``Content-Type`` in the request
601
422 (Unprocessable Entity)   The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
602
===========================  ==============================
603

    
604

    
605
COPY
606
""""
607

    
608
====================  ================================
609
Request Header Name   Value
610
====================  ================================
611
Destination           The destination path in the form ``/<container>/<object>``
612
Content-Type          The MIME content type of the object (optional)
613
Content-Encoding      The encoding of the object (optional)
614
Content-Disposition   The presentation style of the object (optional)
615
X-Source-Version      The source version to copy from
616
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
617
X-Object-Sharing      Object permissions (optional)
618
X-Object-Public       Object is publicly accessible (optional)
619
X-Object-Meta-*       Optional user defined metadata
620
====================  ================================
621

    
622
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
623

    
624
No reply content/headers.
625

    
626
===========================  ==============================
627
Return Code                  Description
628
===========================  ==============================
629
201 (Created)                The object has been created
630
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
631
===========================  ==============================
632

    
633

    
634
MOVE
635
""""
636

    
637
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
638

    
639

    
640
POST
641
""""
642

    
643
======================  ============================================
644
Request Parameter Name  Value
645
======================  ============================================
646
update                  Do not replace metadata (no value parameter)
647
======================  ============================================
648

    
649
|
650

    
651
====================  ================================
652
Request Header Name   Value
653
====================  ================================
654
Content-Length        The size of the data written (optional, to update)
655
Content-Type          The MIME content type of the object (optional, to update)
656
Content-Range         The range of data supplied (optional, to update)
657
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
658
Content-Encoding      The encoding of the object (optional)
659
Content-Disposition   The presentation style of the object (optional)
660
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
661
X-Object-Sharing      Object permissions (optional)
662
X-Object-Public       Object is publicly accessible (optional)
663
X-Object-Meta-*       Optional user defined metadata
664
====================  ================================
665

    
666
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
667

    
668
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
669

    
670
To update an object's data:
671

    
672
* Set ``Content-Type`` to ``application/octet-stream``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
673
* Supply ``Content-Length`` (except if using chunked transfers), ``Content-Type`` and ``Content-Range`` headers.
674
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
675

    
676
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
677
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
678
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data supplied will be appended to the object.
679

    
680
A data update will trigger an ETag change. The new ETag will not correspond to the object's MD5 sum (**TBD**) and will be included in reply headers.
681

    
682
No reply content. No reply headers if only metadata is updated.
683

    
684
==========================  ===============================
685
Reply Header Name           Value
686
==========================  ===============================
687
ETag                        The new ETag of the object (data updated)
688
==========================  ===============================
689

    
690
|
691

    
692
===========================  ==============================
693
Return Code                  Description
694
===========================  ==============================
695
202 (Accepted)               The request has been accepted (not a data update)
696
204 (No Content)             The request succeeded (data updated)
697
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
698
411 (Length Required)        Missing ``Content-Length`` in the request
699
416 (Range Not Satisfiable)  The supplied range is invalid
700
===========================  ==============================
701

    
702

    
703
DELETE
704
""""""
705

    
706
No request parameters/headers.
707

    
708
No reply content/headers.
709

    
710
===========================  ==============================
711
Return Code                  Description
712
===========================  ==============================
713
204 (No Content)             The request succeeded
714
===========================  ==============================
715

    
716
Sharing and Public Objects
717
^^^^^^^^^^^^^^^^^^^^^^^^^^
718

    
719
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
720

    
721
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden).
722

    
723
==========================  ===============================
724
Reply Header Name           Value
725
==========================  ===============================
726
ETag                        The ETag of the object
727
Content-Length              The size of the data returned
728
Content-Type                The MIME content type of the object
729
Content-Range               The range of data included (only on a single range request)
730
Last-Modified               The last object modification date (regardless of version)
731
Content-Encoding            The encoding of the object (optional)
732
Content-Disposition         The presentation style of the object (optional)
733
==========================  ===============================
734

    
735
Summary
736
^^^^^^^
737

    
738
List of differences from the OOS API:
739

    
740
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
741
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
742
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings.
743
* Container policies to manage behavior and limits.
744
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
745
* All metadata replies, at all levels, include latest modification information.
746
* At all levels, a ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
747
* Container/object lists include all associated metadata if the reply is of type json/xml. Some names are kept to their OOS API equivalents for compatibility. 
748
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
749
* Multi-range object GET support as outlined in RFC2616.
750
* Object hashmap retrieval through GET and the ``format`` parameter.
751
* Partial object updates through POST, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers.
752
* Object ``MOVE`` support.
753
* Time-variant account/container listings via the ``until`` parameter.
754
* Object versions - parameter ``version`` in HEAD/GET (list versions with GET), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in PUT/COPY.
755
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
756
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
757
* Large object support with ``X-Object-Manifest``.
758
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
759

    
760
Clarifications/suggestions:
761

    
762
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
763
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
764
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
765
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
766
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired reply format. The parameter overrides the header.
767
* Container/object lists use a ``200`` return code if the reply is of type json/xml. The reply will include an empty json/xml.
768
* In headers, dates are formatted according to RFC 1123. In extended information listings, dates are formatted according to ISO 8601.
769
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
770
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
771
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
772

    
773
The Pithos Client
774
-----------------
775

    
776
User Experience
777
^^^^^^^^^^^^^^^
778

    
779
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
780

    
781
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
782

    
783
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
784
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
785
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
786
* The ``others`` element, which contains all objects that other users share with the user.
787
* The ``tags`` element, which lists the names of tags the user has defined. This can be an entry point to list all files that have been assigned a specific tag or manage tags in general (remove a tag completely, rename a tag etc.).
788
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
789

    
790
Objects in Pithos can be:
791

    
792
* Assigned custom tags.
793
* Moved to trash and then deleted.
794
* Shared with specific permissions.
795
* Made public (shared with non-Pithos users).
796
* Restored from previous versions.
797

    
798
Some of these functions are performed by the client software and some by the Pithos server. Client-driven functionality is based on specific metadata that should be handled equally across implementations. These metadata names are discussed in the next chapter. 
799

    
800
Conventions and Metadata Specification
801
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
802

    
803
Pithos clients should use the ``pithos`` container for all Pithos objects. Object names use the ``/`` delimiter to impose a hierarchy of folders and files.
804

    
805
At the object level, tags are implemented by managing metadata keys. The client software should allow the user to use any string as a tag (except ``trash``) and then set the corresponding ``X-Object-Meta-<tag>`` key at the server. The API extensions provided, allow for listing all tags in a container and filtering object listings based on one or more tags. The tag list is sufficient for implementing the ``tags`` element, either as a special, virtual folder (as done in the first version of Pithos), or as an application menu.
806

    
807
To manage the deletion of files use the same API and the ``X-Object-Meta-Trash`` key. The string ``trash`` can not be used as a tag. The ``trash`` element should be presented as a folder, although with no hierarchy.
808

    
809
The metadata specification is summarized in the following table.
810

    
811
===========================  ==============================
812
Metadata Name                Value
813
===========================  ==============================
814
X-Object-Meta-Trash          Set to ``true`` if the object has been moved to the trash
815
X-Object-Meta-*              Use for other tags that apply to the object
816
===========================  ==============================
817

    
818
Recommended Practices and Examples
819
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
820

    
821
Assuming an authentication token is obtained (**TBD**), the following high-level operations are available - shown with ``curl``:
822

    
823
* Get account information ::
824

    
825
    curl -X HEAD -D - \
826
         -H "X-Auth-Token: 0000" \
827
         https://pithos.dev.grnet.gr/v1/user
828

    
829
* List available containers ::
830

    
831
    curl -X GET -D - \
832
         -H "X-Auth-Token: 0000" \
833
         https://pithos.dev.grnet.gr/v1/user
834

    
835
* Get container information ::
836

    
837
    curl -X HEAD -D - \
838
         -H "X-Auth-Token: 0000" \
839
         https://pithos.dev.grnet.gr/v1/user/pithos
840

    
841
* Add a new container ::
842

    
843
    curl -X PUT -D - \
844
         -H "X-Auth-Token: 0000" \
845
         https://pithos.dev.grnet.gr/v1/user/test
846

    
847
* Delete a container ::
848

    
849
    curl -X DELETE -D - \
850
         -H "X-Auth-Token: 0000" \
851
         https://pithos.dev.grnet.gr/v1/user/test
852

    
853
* List objects in a container ::
854

    
855
    curl -X GET -D - \
856
         -H "X-Auth-Token: 0000" \
857
         https://pithos.dev.grnet.gr/v1/user/pithos
858

    
859
* List objects in a container (extended reply) ::
860

    
861
    curl -X GET -D - \
862
         -H "X-Auth-Token: 0000" \
863
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
864

    
865
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
866

    
867
* List metadata keys used by objects in a container
868

    
869
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``).
870

    
871
* List objects in a container having a specific meta defined ::
872

    
873
    curl -X GET -D - \
874
         -H "X-Auth-Token: 0000" \
875
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=trash
876

    
877
  This is the recommended way of tagging/retrieving objects in trash.
878

    
879
* Retrieve an object ::
880

    
881
    curl -X GET -D - \
882
         -H "X-Auth-Token: 0000" \
883
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
884

    
885
* Retrieve an object (specific ranges of data) ::
886

    
887
    curl -X GET -D - \
888
         -H "X-Auth-Token: 0000" \
889
         -H "Range: bytes=0-9" \
890
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
891

    
892
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
893

    
894
* Add a new object (folder type) (**TBD**) ::
895

    
896
    curl -X PUT -D - \
897
         -H "X-Auth-Token: 0000" \
898
         -H "Content-Type: application/folder" \
899
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
900

    
901
* Add a new object ::
902

    
903
    curl -X PUT -D - \
904
         -H "X-Auth-Token: 0000" \
905
         -H "Content-Type: text/plain" \
906
         -T EXAMPLE.txt
907
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
908

    
909
* Update an object ::
910

    
911
    curl -X POST -D - \
912
         -H "X-Auth-Token: 0000" \
913
         -H "Content-Length: 10" \
914
         -H "Content-Type: application/octet-stream" \
915
         -H "Content-Range: bytes 10-19/*" \
916
         -d "0123456789" \
917
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
918

    
919
  This will update bytes 10-19 with the data specified.
920

    
921
* Update an object (append) ::
922

    
923
    curl -X POST -D - \
924
         -H "X-Auth-Token: 0000" \
925
         -H "Content-Length: 10" \
926
         -H "Content-Type: application/octet-stream" \
927
         -H "Content-Range: bytes */*" \
928
         -d "0123456789" \
929
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
930

    
931
* Add object metadata ::
932

    
933
    curl -X POST -D - \
934
         -H "X-Auth-Token: 0000" \
935
         -H "X-Object-Meta-First: first_meta_value" \
936
         -H "X-Object-Meta-Second: second_meta_value" \
937
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
938

    
939
* Delete object metadata ::
940

    
941
    curl -X POST -D - \
942
         -H "X-Auth-Token: 0000" \
943
         -H "X-Object-Meta-First: first_meta_value" \
944
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
945

    
946
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
947

    
948
* Delete an object ::
949

    
950
    curl -X DELETE -D - \
951
         -H "X-Auth-Token: 0000" \
952
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
953