Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ 7b0688a9

History | View | Annotate | Download (47.4 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
Document Revisions
23
^^^^^^^^^^^^^^^^^^
24

    
25
=========================  ================================
26
Revision                   Description
27
=========================  ================================
28
0.4 (July 01, 2011)        Object permissions and account groups.
29
\                          Control versioning behavior and container quotas with container policy directives.
30
\                          Support updating/deleting individual metadata with ``POST``.
31
\                          Create object using hashmap.
32
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
33
\                          Allow for publicly available objects via ``https://hostname/public``.
34
\                          Support time-variant account/container listings. 
35
\                          Add source version when duplicating with PUT/COPY.
36
\                          Request version in object HEAD/GET requests (list versions with GET).
37
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
38
\                          Include underlying storage characteristics in container meta.
39
\                          Support for partial object updates through POST.
40
\                          Expose object hashmaps through GET.
41
\                          Support for multi-range object GET requests.
42
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
43
=========================  ================================
44

    
45
The Pithos API
46
--------------
47

    
48
The URI requests supported by the Pithos API follow one of the following forms:
49

    
50
* Top level: ``https://hostname/v1/``
51
* Account level: ``https://hostname/v1/<account>``
52
* Container level: ``https://hostname/v1/<account>/<container>``
53
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
54

    
55
All requests must include an ``X-Auth-Token``. The process of obtaining the token is still to be determined (**TBD**).
56

    
57
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
58

    
59
=========================  ================================
60
Return Code                Description
61
=========================  ================================
62
400 (Bad Request)          The request is invalid
63
401 (Unauthorized)         Request not allowed
64
404 (Not Found)            The requested resource was not found
65
503 (Service Unavailable)  The request cannot be completed because of an internal error
66
=========================  ================================
67

    
68
Top Level
69
^^^^^^^^^
70

    
71
List of operations:
72

    
73
=========  ==================
74
Operation  Description
75
=========  ==================
76
GET        Authentication. This is kept for compatibility with the OOS API
77
=========  ==================
78

    
79
GET
80
"""
81

    
82
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
83

    
84
================  =====================
85
Return Code       Description
86
================  =====================
87
204 (No Content)  The request succeeded
88
================  =====================
89

    
90

    
91
Account Level
92
^^^^^^^^^^^^^
93

    
94
List of operations:
95

    
96
=========  ==================
97
Operation  Description
98
=========  ==================
99
HEAD       Retrieve account metadata
100
GET        List containers
101
POST       Update account metadata
102
=========  ==================
103

    
104
HEAD
105
""""
106

    
107
======================  ===================================
108
Request Parameter Name  Value
109
======================  ===================================
110
until                   Optional timestamp
111
======================  ===================================
112

    
113
|
114

    
115
==========================  =====================
116
Reply Header Name           Value
117
==========================  =====================
118
X-Account-Container-Count   The total number of containers
119
X-Account-Object-Count      The total number of objects (**TBD**)
120
X-Account-Bytes-Used        The total number of bytes stored
121
X-Account-Bytes-Remaining   The total number of bytes remaining (**TBD**)
122
X-Account-Last-Login        The last login (**TBD**)
123
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
124
X-Account-Group-*           Optional user defined groups
125
X-Account-Meta-*            Optional user defined metadata
126
Last-Modified               The last account modification date (regardless of ``until``)
127
==========================  =====================
128

    
129
|
130

    
131
================  =====================
132
Return Code       Description
133
================  =====================
134
204 (No Content)  The request succeeded
135
================  =====================
136

    
137

    
138
GET
139
"""
140

    
141
====================  ===========================
142
Request Header Name   Value
143
====================  ===========================
144
If-Modified-Since     Retrieve if account has changed since provided timestamp
145
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
146
====================  ===========================
147

    
148
|
149

    
150
======================  =========================
151
Request Parameter Name  Value
152
======================  =========================
153
limit                   The amount of results requested (default is 10000)
154
marker                  Return containers with name lexicographically after marker
155
format                  Optional extended reply type (can be ``json`` or ``xml``)
156
until                   Optional timestamp
157
======================  =========================
158

    
159
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
160
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
161
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
162

    
163
===========================  ============================
164
Name                         Description
165
===========================  ============================
166
name                         The name of the container
167
count                        The number of objects inside the container
168
bytes                        The total size of the objects inside the container
169
last_modified                The last container modification date (regardless of ``until``)
170
x_container_until_timestamp  The last container modification date until the timestamp provided
171
x_container_policy_*         Container behavior and limits
172
x_container_meta_*           Optional user defined metadata
173
===========================  ============================
174

    
175
For examples of container details returned in JSON/XML formats refer to the OOS API documentation.
176

    
177
===========================  =====================
178
Return Code                  Description
179
===========================  =====================
180
200 (OK)                     The request succeeded
181
204 (No Content)             The account has no containers (only for non-extended replies)
182
304 (Not Modified)           The account has not been modified
183
412 (Precondition Failed)    The condition set can not be satisfied
184
===========================  =====================
185

    
186
Will use a ``200`` return code if the reply is of type json/xml.
187

    
188

    
189
POST
190
""""
191

    
192
======================  ============================================
193
Request Parameter Name  Value
194
======================  ============================================
195
update                  Do not replace metadata/groups (no value parameter)
196
======================  ============================================
197

    
198
|
199

    
200
====================  ===========================
201
Request Header Name   Value
202
====================  ===========================
203
X-Account-Group-*     Optional user defined groups
204
X-Account-Meta-*      Optional user defined metadata
205
====================  ===========================
206

    
207
No reply content/headers.
208

    
209
The operation will overwrite all user defined metadata, except if ``update`` is defined.
210
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
211

    
212
================  ===============================
213
Return Code       Description
214
================  ===============================
215
202 (Accepted)    The request has been accepted
216
================  ===============================
217

    
218

    
219
Container Level
220
^^^^^^^^^^^^^^^
221

    
222
List of operations:
223

    
224
=========  ============================
225
Operation  Description
226
=========  ============================
227
HEAD       Retrieve container metadata
228
GET        List objects
229
PUT        Create/update container
230
POST       Update container metadata
231
DELETE     Delete container
232
=========  ============================
233

    
234

    
235
HEAD
236
""""
237

    
238
======================  ===================================
239
Request Parameter Name  Value
240
======================  ===================================
241
until                   Optional timestamp
242
======================  ===================================
243

    
244
|
245

    
246
===========================  ===============================
247
Reply Header Name            Value
248
===========================  ===============================
249
X-Container-Object-Count     The total number of objects in the container
250
X-Container-Bytes-Used       The total number of bytes of all objects stored
251
X-Container-Block-Size       The block size used by the storage backend
252
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
253
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
254
X-Container-Object-Meta      A list with all meta keys used by objects
255
X-Container-Policy-*         Container behavior and limits
256
X-Container-Meta-*           Optional user defined metadata
257
Last-Modified                The last container modification date (regardless of ``until``)
258
===========================  ===============================
259

    
260
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix. See container ``PUT`` for a reference of policy directives.
261

    
262
================  ===============================
263
Return Code       Description
264
================  ===============================
265
204 (No Content)  The request succeeded
266
================  ===============================
267

    
268

    
269
GET
270
"""
271

    
272
====================  ===========================
273
Request Header Name   Value
274
====================  ===========================
275
If-Modified-Since     Retrieve if container has changed since provided timestamp
276
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
277
====================  ===========================
278

    
279
|
280

    
281
======================  ===================================
282
Request Parameter Name  Value
283
======================  ===================================
284
limit                   The amount of results requested (default is 10000)
285
marker                  Return containers with name lexicographically after marker
286
prefix                  Return objects starting with prefix
287
delimiter               Return objects up to the delimiter (discussion follows)
288
path                    Assume ``prefix=path`` and ``delimiter=/``
289
format                  Optional extended reply type (can be ``json`` or ``xml``)
290
meta                    Return objects having the specified meta keys (can be a comma separated list)
291
until                   Optional timestamp
292
======================  ===================================
293

    
294
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
295

    
296
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
297

    
298
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
299
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
300
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
301

    
302
==========================  ======================================
303
Name                        Description
304
==========================  ======================================
305
name                        The name of the object
306
hash                        The ETag of the object
307
bytes                       The size of the object
308
content_type                The MIME content type of the object
309
content_encoding            The encoding of the object (optional)
310
content-disposition         The presentation style of the object (optional)
311
last_modified               The last object modification date (regardless of version)
312
x_object_version            The object's version identifier
313
x_object_version_timestamp  The object's version timestamp
314
x_object_modified_by        The user that committed the object's version
315
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
316
x_object_sharing            Object permissions (optional)
317
x_object_shared_by          Object inheriting permissions (optional)
318
x_object_public             Object's publicly accessible URI (optional)
319
x_object_meta_*             Optional user defined metadata
320
==========================  ======================================
321

    
322
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
323
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
324
In JSON results they appear as dictionaries with only a ``"subdir"`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
325
In case there is an object with the same name as a virtual directory marker, the object will be returned.
326

    
327
For examples of object details returned in JSON/XML formats refer to the OOS API documentation.
328

    
329
===========================  ===============================
330
Return Code                  Description
331
===========================  ===============================
332
200 (OK)                     The request succeeded
333
204 (No Content)             The account has no containers (only for non-extended replies)
334
304 (Not Modified)           The container has not been modified
335
412 (Precondition Failed)    The condition set can not be satisfied
336
===========================  ===============================
337

    
338
Will use a ``200`` return code if the reply is of type json/xml.
339

    
340

    
341
PUT
342
"""
343

    
344
====================  ================================
345
Request Header Name   Value
346
====================  ================================
347
X-Container-Policy-*  Container behavior and limits
348
X-Container-Meta-*    Optional user defined metadata
349
====================  ================================
350
 
351
No reply content/headers.
352

    
353
If no policy is defined, the container will be created with the default values.
354
Available policy directives:
355

    
356
* ``versioning``: Set to ``auto``, ``manual`` or ``none`` (default is ``manual``)
357
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
358
 
359
================  ===============================
360
Return Code       Description
361
================  ===============================
362
201 (Created)     The container has been created
363
202 (Accepted)    The request has been accepted
364
================  ===============================
365

    
366

    
367
POST
368
""""
369

    
370
======================  ============================================
371
Request Parameter Name  Value
372
======================  ============================================
373
update                  Do not replace metadata/policy (no value parameter)
374
======================  ============================================
375

    
376
|
377

    
378
====================  ================================
379
Request Header Name   Value
380
====================  ================================
381
X-Container-Policy-*  Container behavior and limits
382
X-Container-Meta-*    Optional user defined metadata
383
====================  ================================
384

    
385
No reply content/headers.
386

    
387
The operation will overwrite all user defined metadata, except if ``update`` is defined.
388
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
389

    
390
================  ===============================
391
Return Code       Description
392
================  ===============================
393
202 (Accepted)    The request has been accepted
394
================  ===============================
395

    
396

    
397
DELETE
398
""""""
399

    
400
No request parameters/headers.
401

    
402
No reply content/headers.
403

    
404
================  ===============================
405
Return Code       Description
406
================  ===============================
407
204 (No Content)  The request succeeded
408
409 (Conflict)    The container is not empty
409
================  ===============================
410

    
411

    
412
Object Level
413
^^^^^^^^^^^^
414

    
415
List of operations:
416

    
417
=========  =================================
418
Operation  Description
419
=========  =================================
420
HEAD       Retrieve object metadata
421
GET        Read object data
422
PUT        Write object data or copy/move object
423
COPY       Copy object
424
MOVE       Move object
425
POST       Update object metadata/data
426
DELETE     Delete object
427
=========  =================================
428

    
429

    
430
HEAD
431
""""
432

    
433
======================  ===================================
434
Request Parameter Name  Value
435
======================  ===================================
436
version                 Optional version identifier
437
======================  ===================================
438

    
439
|
440

    
441
==========================  ===============================
442
Reply Header Name           Value
443
==========================  ===============================
444
ETag                        The ETag of the object
445
Content-Length              The size of the object
446
Content-Type                The MIME content type of the object
447
Last-Modified               The last object modification date (regardless of version)
448
Content-Encoding            The encoding of the object (optional)
449
Content-Disposition         The presentation style of the object (optional)
450
X-Object-Version            The object's version identifier
451
X-Object-Version-Timestamp  The object's version timestamp
452
X-Object-Modified-By        The user that comitted the object's version
453
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
454
X-Object-Sharing            Object permissions (optional)
455
X-Object-Shared-By          Object inheriting permissions (optional)
456
X-Object-Public             Object's publicly accessible URI (optional)
457
X-Object-Meta-*             Optional user defined metadata
458
==========================  ===============================
459

    
460
|
461

    
462
================  ===============================
463
Return Code       Description
464
================  ===============================
465
200 (No Content)  The request succeeded
466
================  ===============================
467

    
468

    
469
GET
470
"""
471

    
472
====================  ================================
473
Request Header Name   Value
474
====================  ================================
475
Range                 Optional range of data to retrieve
476
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
477
If-Match              Retrieve if ETags match
478
If-None-Match         Retrieve if ETags don't match
479
If-Modified-Since     Retrieve if object has changed since provided timestamp
480
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
481
====================  ================================
482

    
483
|
484

    
485
======================  ===================================
486
Request Parameter Name  Value
487
======================  ===================================
488
format                  Optional extended reply type (can be ``json`` or ``xml``)
489
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
490
======================  ===================================
491

    
492
The reply is the object's data (or part of it), except if a hashmap is requested with the ``format`` parameter, or a version list with ``version=list`` (in which case an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
493

    
494
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block.
495

    
496
Example ``format=json`` reply:
497

    
498
::
499

    
500
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
501

    
502
Example ``format=xml`` reply:
503

    
504
::
505

    
506
  <?xml version="1.0" encoding="UTF-8"?>
507
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
508
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
509
    <hash>...</hash>
510
  </object>
511

    
512
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
513

    
514
Example ``format=json`` reply:
515

    
516
::
517

    
518
  {"versions": [[23, 1307700892], [28, 1307700898], ...]}
519

    
520
Example ``format=xml`` reply:
521

    
522
::
523

    
524
  <?xml version="1.0" encoding="UTF-8"?>
525
  <object name="file">
526
    <version timestamp="1307700892">23</version>
527
    <version timestamp="1307700898">28</version>
528
    <version timestamp="...">...</version>
529
  </object>
530

    
531
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
532

    
533
==========================  ===============================
534
Reply Header Name           Value
535
==========================  ===============================
536
ETag                        The ETag of the object
537
Content-Length              The size of the data returned
538
Content-Type                The MIME content type of the object
539
Content-Range               The range of data included (only on a single range request)
540
Last-Modified               The last object modification date (regardless of version)
541
Content-Encoding            The encoding of the object (optional)
542
Content-Disposition         The presentation style of the object (optional)
543
X-Object-Version            The object's version identifier
544
X-Object-Version-Timestamp  The object's version timestamp
545
X-Object-Modified-By        The user that comitted the object's version
546
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
547
X-Object-Sharing            Object permissions (optional)
548
X-Object-Shared-By          Object inheriting permissions (optional)
549
X-Object-Public             Object's publicly accessible URI (optional)
550
X-Object-Meta-*             Optional user defined metadata
551
==========================  ===============================
552

    
553
|
554

    
555
===========================  ==============================
556
Return Code                  Description
557
===========================  ==============================
558
200 (OK)                     The request succeeded
559
206 (Partial Content)        The range request succeeded
560
304 (Not Modified)           The object has not been modified
561
412 (Precondition Failed)    The condition set can not be satisfied
562
416 (Range Not Satisfiable)  The requested range is out of limits
563
===========================  ==============================
564

    
565

    
566
PUT
567
"""
568

    
569
====================  ================================
570
Request Header Name   Value
571
====================  ================================
572
ETag                  The MD5 hash of the object (optional to check written data)
573
Content-Length        The size of the data written
574
Content-Type          The MIME content type of the object
575
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
576
X-Copy-From           The source path in the form ``/<container>/<object>``
577
X-Move-From           The source path in the form ``/<container>/<object>``
578
X-Source-Version      The source version to copy from
579
Content-Encoding      The encoding of the object (optional)
580
Content-Disposition   The presentation style of the object (optional)
581
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
582
X-Object-Sharing      Object permissions (optional)
583
X-Object-Public       Object is publicly accessible (optional)
584
X-Object-Meta-*       Optional user defined metadata
585
====================  ================================
586

    
587
|
588

    
589
======================  ===================================
590
Request Parameter Name  Value
591
======================  ===================================
592
format                  Optional extended request type (can be ``json``) to create the object by suppling its hashmap instead
593
======================  ===================================
594

    
595
The request is the object's data (or part of it), except if a hashmap is provided with the ``format`` parameter.  If format is used and all different parts are stored in the server, the object is created otherwise the server returns Conflict (409) with the list of the missing parts. 
596

    
597
Hashmaps expose the underlying storage format of the object.
598

    
599
Example ``format=json`` request:
600

    
601
::
602

    
603
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
604

    
605

    
606
==========================  ===============================
607
Reply Header Name           Value
608
==========================  ===============================
609
ETag                        The MD5 hash of the object (on create)
610
==========================  ===============================
611

    
612
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
613

    
614
===========================  ==============================
615
Return Code                  Description
616
===========================  ==============================
617
201 (Created)                The object has been created
618
409 (Conflict)               The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a conflicting sharing path will be included in the reply - in JSON format)
619
411 (Length Required)        Missing ``Content-Length`` or ``Content-Type`` in the request
620
422 (Unprocessable Entity)   The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
621
409 (Conflict)               Conflict with the current state of the resource
622
===========================  ==============================
623

    
624

    
625
COPY
626
""""
627

    
628
====================  ================================
629
Request Header Name   Value
630
====================  ================================
631
Destination           The destination path in the form ``/<container>/<object>``
632
Content-Type          The MIME content type of the object (optional)
633
Content-Encoding      The encoding of the object (optional)
634
Content-Disposition   The presentation style of the object (optional)
635
X-Source-Version      The source version to copy from
636
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
637
X-Object-Sharing      Object permissions (optional)
638
X-Object-Public       Object is publicly accessible (optional)
639
X-Object-Meta-*       Optional user defined metadata
640
====================  ================================
641

    
642
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
643

    
644
No reply content/headers.
645

    
646
===========================  ==============================
647
Return Code                  Description
648
===========================  ==============================
649
201 (Created)                The object has been created
650
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
651
===========================  ==============================
652

    
653

    
654
MOVE
655
""""
656

    
657
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
658

    
659

    
660
POST
661
""""
662

    
663
======================  ============================================
664
Request Parameter Name  Value
665
======================  ============================================
666
update                  Do not replace metadata (no value parameter)
667
======================  ============================================
668

    
669
|
670

    
671
====================  ================================
672
Request Header Name   Value
673
====================  ================================
674
Content-Length        The size of the data written (optional, to update)
675
Content-Type          The MIME content type of the object (optional, to update)
676
Content-Range         The range of data supplied (optional, to update)
677
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
678
Content-Encoding      The encoding of the object (optional)
679
Content-Disposition   The presentation style of the object (optional)
680
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
681
X-Object-Sharing      Object permissions (optional)
682
X-Object-Public       Object is publicly accessible (optional)
683
X-Object-Meta-*       Optional user defined metadata
684
====================  ================================
685

    
686
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
687

    
688
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
689

    
690
To update an object's data:
691

    
692
* Set ``Content-Type`` to ``application/octet-stream``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
693
* Supply ``Content-Length`` (except if using chunked transfers), ``Content-Type`` and ``Content-Range`` headers.
694
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
695

    
696
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
697
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
698
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data supplied will be appended to the object.
699

    
700
A data update will trigger an ETag change. The new ETag will not correspond to the object's MD5 sum (**TBD**) and will be included in reply headers.
701

    
702
No reply content. No reply headers if only metadata is updated.
703

    
704
==========================  ===============================
705
Reply Header Name           Value
706
==========================  ===============================
707
ETag                        The new ETag of the object (data updated)
708
==========================  ===============================
709

    
710
|
711

    
712
===========================  ==============================
713
Return Code                  Description
714
===========================  ==============================
715
202 (Accepted)               The request has been accepted (not a data update)
716
204 (No Content)             The request succeeded (data updated)
717
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
718
411 (Length Required)        Missing ``Content-Length`` in the request
719
416 (Range Not Satisfiable)  The supplied range is invalid
720
===========================  ==============================
721

    
722

    
723
DELETE
724
""""""
725

    
726
No request parameters/headers.
727

    
728
No reply content/headers.
729

    
730
===========================  ==============================
731
Return Code                  Description
732
===========================  ==============================
733
204 (No Content)             The request succeeded
734
===========================  ==============================
735

    
736
Sharing and Public Objects
737
^^^^^^^^^^^^^^^^^^^^^^^^^^
738

    
739
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
740

    
741
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden).
742

    
743
==========================  ===============================
744
Reply Header Name           Value
745
==========================  ===============================
746
ETag                        The ETag of the object
747
Content-Length              The size of the data returned
748
Content-Type                The MIME content type of the object
749
Content-Range               The range of data included (only on a single range request)
750
Last-Modified               The last object modification date (regardless of version)
751
Content-Encoding            The encoding of the object (optional)
752
Content-Disposition         The presentation style of the object (optional)
753
==========================  ===============================
754

    
755
Summary
756
^^^^^^^
757

    
758
List of differences from the OOS API:
759

    
760
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
761
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
762
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings.
763
* Container policies to manage behavior and limits.
764
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
765
* All metadata replies, at all levels, include latest modification information.
766
* At all levels, a ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
767
* Container/object lists include all associated metadata if the reply is of type json/xml. Some names are kept to their OOS API equivalents for compatibility. 
768
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
769
* Multi-range object GET support as outlined in RFC2616.
770
* Object hashmap retrieval through GET and the ``format`` parameter.
771
* Partial object updates through POST, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers.
772
* Object ``MOVE`` support.
773
* Time-variant account/container listings via the ``until`` parameter.
774
* Object versions - parameter ``version`` in HEAD/GET (list versions with GET), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in PUT/COPY.
775
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
776
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
777
* Large object support with ``X-Object-Manifest``.
778
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
779

    
780
Clarifications/suggestions:
781

    
782
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
783
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
784
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
785
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
786
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired reply format. The parameter overrides the header.
787
* Container/object lists use a ``200`` return code if the reply is of type json/xml. The reply will include an empty json/xml.
788
* In headers, dates are formatted according to RFC 1123. In extended information listings, dates are formatted according to ISO 8601.
789
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
790
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
791
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
792

    
793
The Pithos Client
794
-----------------
795

    
796
User Experience
797
^^^^^^^^^^^^^^^
798

    
799
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
800

    
801
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
802

    
803
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
804
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
805
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
806
* The ``others`` element, which contains all objects that other users share with the user.
807
* The ``tags`` element, which lists the names of tags the user has defined. This can be an entry point to list all files that have been assigned a specific tag or manage tags in general (remove a tag completely, rename a tag etc.).
808
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
809

    
810
Objects in Pithos can be:
811

    
812
* Assigned custom tags.
813
* Moved to trash and then deleted.
814
* Shared with specific permissions.
815
* Made public (shared with non-Pithos users).
816
* Restored from previous versions.
817

    
818
Some of these functions are performed by the client software and some by the Pithos server. Client-driven functionality is based on specific metadata that should be handled equally across implementations. These metadata names are discussed in the next chapter. 
819

    
820
Conventions and Metadata Specification
821
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
822

    
823
Pithos clients should use the ``pithos`` container for all Pithos objects. Object names use the ``/`` delimiter to impose a hierarchy of folders and files.
824

    
825
At the object level, tags are implemented by managing metadata keys. The client software should allow the user to use any string as a tag (except ``trash``) and then set the corresponding ``X-Object-Meta-<tag>`` key at the server. The API extensions provided, allow for listing all tags in a container and filtering object listings based on one or more tags. The tag list is sufficient for implementing the ``tags`` element, either as a special, virtual folder (as done in the first version of Pithos), or as an application menu.
826

    
827
To manage the deletion of files use the same API and the ``X-Object-Meta-Trash`` key. The string ``trash`` can not be used as a tag. The ``trash`` element should be presented as a folder, although with no hierarchy.
828

    
829
The metadata specification is summarized in the following table.
830

    
831
===========================  ==============================
832
Metadata Name                Value
833
===========================  ==============================
834
X-Object-Meta-Trash          Set to ``true`` if the object has been moved to the trash
835
X-Object-Meta-*              Use for other tags that apply to the object
836
===========================  ==============================
837

    
838
Recommended Practices and Examples
839
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
840

    
841
Assuming an authentication token is obtained (**TBD**), the following high-level operations are available - shown with ``curl``:
842

    
843
* Get account information ::
844

    
845
    curl -X HEAD -D - \
846
         -H "X-Auth-Token: 0000" \
847
         https://pithos.dev.grnet.gr/v1/user
848

    
849
* List available containers ::
850

    
851
    curl -X GET -D - \
852
         -H "X-Auth-Token: 0000" \
853
         https://pithos.dev.grnet.gr/v1/user
854

    
855
* Get container information ::
856

    
857
    curl -X HEAD -D - \
858
         -H "X-Auth-Token: 0000" \
859
         https://pithos.dev.grnet.gr/v1/user/pithos
860

    
861
* Add a new container ::
862

    
863
    curl -X PUT -D - \
864
         -H "X-Auth-Token: 0000" \
865
         https://pithos.dev.grnet.gr/v1/user/test
866

    
867
* Delete a container ::
868

    
869
    curl -X DELETE -D - \
870
         -H "X-Auth-Token: 0000" \
871
         https://pithos.dev.grnet.gr/v1/user/test
872

    
873
* List objects in a container ::
874

    
875
    curl -X GET -D - \
876
         -H "X-Auth-Token: 0000" \
877
         https://pithos.dev.grnet.gr/v1/user/pithos
878

    
879
* List objects in a container (extended reply) ::
880

    
881
    curl -X GET -D - \
882
         -H "X-Auth-Token: 0000" \
883
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
884

    
885
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
886

    
887
* List metadata keys used by objects in a container
888

    
889
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``).
890

    
891
* List objects in a container having a specific meta defined ::
892

    
893
    curl -X GET -D - \
894
         -H "X-Auth-Token: 0000" \
895
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=trash
896

    
897
  This is the recommended way of tagging/retrieving objects in trash.
898

    
899
* Retrieve an object ::
900

    
901
    curl -X GET -D - \
902
         -H "X-Auth-Token: 0000" \
903
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
904

    
905
* Retrieve an object (specific ranges of data) ::
906

    
907
    curl -X GET -D - \
908
         -H "X-Auth-Token: 0000" \
909
         -H "Range: bytes=0-9" \
910
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
911

    
912
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
913

    
914
* Add a new object (folder type) (**TBD**) ::
915

    
916
    curl -X PUT -D - \
917
         -H "X-Auth-Token: 0000" \
918
         -H "Content-Type: application/folder" \
919
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
920

    
921
* Add a new object ::
922

    
923
    curl -X PUT -D - \
924
         -H "X-Auth-Token: 0000" \
925
         -H "Content-Type: text/plain" \
926
         -T EXAMPLE.txt
927
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
928

    
929
* Update an object ::
930

    
931
    curl -X POST -D - \
932
         -H "X-Auth-Token: 0000" \
933
         -H "Content-Length: 10" \
934
         -H "Content-Type: application/octet-stream" \
935
         -H "Content-Range: bytes 10-19/*" \
936
         -d "0123456789" \
937
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
938

    
939
  This will update bytes 10-19 with the data specified.
940

    
941
* Update an object (append) ::
942

    
943
    curl -X POST -D - \
944
         -H "X-Auth-Token: 0000" \
945
         -H "Content-Length: 10" \
946
         -H "Content-Type: application/octet-stream" \
947
         -H "Content-Range: bytes */*" \
948
         -d "0123456789" \
949
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
950

    
951
* Add object metadata ::
952

    
953
    curl -X POST -D - \
954
         -H "X-Auth-Token: 0000" \
955
         -H "X-Object-Meta-First: first_meta_value" \
956
         -H "X-Object-Meta-Second: second_meta_value" \
957
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
958

    
959
* Delete object metadata ::
960

    
961
    curl -X POST -D - \
962
         -H "X-Auth-Token: 0000" \
963
         -H "X-Object-Meta-First: first_meta_value" \
964
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
965

    
966
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
967

    
968
* Delete an object ::
969

    
970
    curl -X DELETE -D - \
971
         -H "X-Auth-Token: 0000" \
972
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
973