Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ ab2e317e

History | View | Annotate | Download (47.5 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
Document Revisions
23
^^^^^^^^^^^^^^^^^^
24

    
25
=========================  ================================
26
Revision                   Description
27
=========================  ================================
28
0.5 (July 11, 2011)        Object update from another object's data.
29
0.4 (July 01, 2011)        Object permissions and account groups.
30
\                          Control versioning behavior and container quotas with container policy directives.
31
\                          Support updating/deleting individual metadata with ``POST``.
32
\                          Create object using hashmap.
33
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
34
\                          Allow for publicly available objects via ``https://hostname/public``.
35
\                          Support time-variant account/container listings. 
36
\                          Add source version when duplicating with PUT/COPY.
37
\                          Request version in object HEAD/GET requests (list versions with GET).
38
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
39
\                          Include underlying storage characteristics in container meta.
40
\                          Support for partial object updates through POST.
41
\                          Expose object hashmaps through GET.
42
\                          Support for multi-range object GET requests.
43
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
44
=========================  ================================
45

    
46
The Pithos API
47
--------------
48

    
49
The URI requests supported by the Pithos API follow one of the following forms:
50

    
51
* Top level: ``https://hostname/v1/``
52
* Account level: ``https://hostname/v1/<account>``
53
* Container level: ``https://hostname/v1/<account>/<container>``
54
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
55

    
56
All requests must include an ``X-Auth-Token``. The process of obtaining the token is still to be determined (**TBD**).
57

    
58
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
59

    
60
=========================  ================================
61
Return Code                Description
62
=========================  ================================
63
400 (Bad Request)          The request is invalid
64
401 (Unauthorized)         Request not allowed
65
404 (Not Found)            The requested resource was not found
66
503 (Service Unavailable)  The request cannot be completed because of an internal error
67
=========================  ================================
68

    
69
Top Level
70
^^^^^^^^^
71

    
72
List of operations:
73

    
74
=========  ==================
75
Operation  Description
76
=========  ==================
77
GET        Authentication. This is kept for compatibility with the OOS API
78
=========  ==================
79

    
80
GET
81
"""
82

    
83
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
84

    
85
================  =====================
86
Return Code       Description
87
================  =====================
88
204 (No Content)  The request succeeded
89
================  =====================
90

    
91

    
92
Account Level
93
^^^^^^^^^^^^^
94

    
95
List of operations:
96

    
97
=========  ==================
98
Operation  Description
99
=========  ==================
100
HEAD       Retrieve account metadata
101
GET        List containers
102
POST       Update account metadata
103
=========  ==================
104

    
105
HEAD
106
""""
107

    
108
======================  ===================================
109
Request Parameter Name  Value
110
======================  ===================================
111
until                   Optional timestamp
112
======================  ===================================
113

    
114
|
115

    
116
==========================  =====================
117
Reply Header Name           Value
118
==========================  =====================
119
X-Account-Container-Count   The total number of containers
120
X-Account-Object-Count      The total number of objects (**TBD**)
121
X-Account-Bytes-Used        The total number of bytes stored
122
X-Account-Bytes-Remaining   The total number of bytes remaining (**TBD**)
123
X-Account-Last-Login        The last login (**TBD**)
124
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
125
X-Account-Group-*           Optional user defined groups
126
X-Account-Meta-*            Optional user defined metadata
127
Last-Modified               The last account modification date (regardless of ``until``)
128
==========================  =====================
129

    
130
|
131

    
132
================  =====================
133
Return Code       Description
134
================  =====================
135
204 (No Content)  The request succeeded
136
================  =====================
137

    
138

    
139
GET
140
"""
141

    
142
====================  ===========================
143
Request Header Name   Value
144
====================  ===========================
145
If-Modified-Since     Retrieve if account has changed since provided timestamp
146
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
147
====================  ===========================
148

    
149
|
150

    
151
======================  =========================
152
Request Parameter Name  Value
153
======================  =========================
154
limit                   The amount of results requested (default is 10000)
155
marker                  Return containers with name lexicographically after marker
156
format                  Optional extended reply type (can be ``json`` or ``xml``)
157
until                   Optional timestamp
158
======================  =========================
159

    
160
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
161
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
162
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
163

    
164
===========================  ============================
165
Name                         Description
166
===========================  ============================
167
name                         The name of the container
168
count                        The number of objects inside the container
169
bytes                        The total size of the objects inside the container
170
last_modified                The last container modification date (regardless of ``until``)
171
x_container_until_timestamp  The last container modification date until the timestamp provided
172
x_container_policy_*         Container behavior and limits
173
x_container_meta_*           Optional user defined metadata
174
===========================  ============================
175

    
176
For examples of container details returned in JSON/XML formats refer to the OOS API documentation.
177

    
178
===========================  =====================
179
Return Code                  Description
180
===========================  =====================
181
200 (OK)                     The request succeeded
182
204 (No Content)             The account has no containers (only for non-extended replies)
183
304 (Not Modified)           The account has not been modified
184
412 (Precondition Failed)    The condition set can not be satisfied
185
===========================  =====================
186

    
187
Will use a ``200`` return code if the reply is of type json/xml.
188

    
189

    
190
POST
191
""""
192

    
193
======================  ============================================
194
Request Parameter Name  Value
195
======================  ============================================
196
update                  Do not replace metadata/groups (no value parameter)
197
======================  ============================================
198

    
199
|
200

    
201
====================  ===========================
202
Request Header Name   Value
203
====================  ===========================
204
X-Account-Group-*     Optional user defined groups
205
X-Account-Meta-*      Optional user defined metadata
206
====================  ===========================
207

    
208
No reply content/headers.
209

    
210
The operation will overwrite all user defined metadata, except if ``update`` is defined.
211
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
212

    
213
================  ===============================
214
Return Code       Description
215
================  ===============================
216
202 (Accepted)    The request has been accepted
217
================  ===============================
218

    
219

    
220
Container Level
221
^^^^^^^^^^^^^^^
222

    
223
List of operations:
224

    
225
=========  ============================
226
Operation  Description
227
=========  ============================
228
HEAD       Retrieve container metadata
229
GET        List objects
230
PUT        Create/update container
231
POST       Update container metadata
232
DELETE     Delete container
233
=========  ============================
234

    
235

    
236
HEAD
237
""""
238

    
239
======================  ===================================
240
Request Parameter Name  Value
241
======================  ===================================
242
until                   Optional timestamp
243
======================  ===================================
244

    
245
|
246

    
247
===========================  ===============================
248
Reply Header Name            Value
249
===========================  ===============================
250
X-Container-Object-Count     The total number of objects in the container
251
X-Container-Bytes-Used       The total number of bytes of all objects stored
252
X-Container-Block-Size       The block size used by the storage backend
253
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
254
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
255
X-Container-Object-Meta      A list with all meta keys used by objects
256
X-Container-Policy-*         Container behavior and limits
257
X-Container-Meta-*           Optional user defined metadata
258
Last-Modified                The last container modification date (regardless of ``until``)
259
===========================  ===============================
260

    
261
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix. See container ``PUT`` for a reference of policy directives.
262

    
263
================  ===============================
264
Return Code       Description
265
================  ===============================
266
204 (No Content)  The request succeeded
267
================  ===============================
268

    
269

    
270
GET
271
"""
272

    
273
====================  ===========================
274
Request Header Name   Value
275
====================  ===========================
276
If-Modified-Since     Retrieve if container has changed since provided timestamp
277
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
278
====================  ===========================
279

    
280
|
281

    
282
======================  ===================================
283
Request Parameter Name  Value
284
======================  ===================================
285
limit                   The amount of results requested (default is 10000)
286
marker                  Return containers with name lexicographically after marker
287
prefix                  Return objects starting with prefix
288
delimiter               Return objects up to the delimiter (discussion follows)
289
path                    Assume ``prefix=path`` and ``delimiter=/``
290
format                  Optional extended reply type (can be ``json`` or ``xml``)
291
meta                    Return objects having the specified meta keys (can be a comma separated list)
292
until                   Optional timestamp
293
======================  ===================================
294

    
295
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
296

    
297
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
298

    
299
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
300
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
301
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
302

    
303
==========================  ======================================
304
Name                        Description
305
==========================  ======================================
306
name                        The name of the object
307
hash                        The ETag of the object
308
bytes                       The size of the object
309
content_type                The MIME content type of the object
310
content_encoding            The encoding of the object (optional)
311
content-disposition         The presentation style of the object (optional)
312
last_modified               The last object modification date (regardless of version)
313
x_object_version            The object's version identifier
314
x_object_version_timestamp  The object's version timestamp
315
x_object_modified_by        The user that committed the object's version
316
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
317
x_object_sharing            Object permissions (optional)
318
x_object_shared_by          Object inheriting permissions (optional)
319
x_object_public             Object's publicly accessible URI (optional)
320
x_object_meta_*             Optional user defined metadata
321
==========================  ======================================
322

    
323
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
324
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
325
In JSON results they appear as dictionaries with only a ``"subdir"`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
326
In case there is an object with the same name as a virtual directory marker, the object will be returned.
327

    
328
For examples of object details returned in JSON/XML formats refer to the OOS API documentation.
329

    
330
===========================  ===============================
331
Return Code                  Description
332
===========================  ===============================
333
200 (OK)                     The request succeeded
334
204 (No Content)             The account has no containers (only for non-extended replies)
335
304 (Not Modified)           The container has not been modified
336
412 (Precondition Failed)    The condition set can not be satisfied
337
===========================  ===============================
338

    
339
Will use a ``200`` return code if the reply is of type json/xml.
340

    
341

    
342
PUT
343
"""
344

    
345
====================  ================================
346
Request Header Name   Value
347
====================  ================================
348
X-Container-Policy-*  Container behavior and limits
349
X-Container-Meta-*    Optional user defined metadata
350
====================  ================================
351
 
352
No reply content/headers.
353

    
354
If no policy is defined, the container will be created with the default values.
355
Available policy directives:
356

    
357
* ``versioning``: Set to ``auto``, ``manual`` or ``none`` (default is ``manual``)
358
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
359
 
360
================  ===============================
361
Return Code       Description
362
================  ===============================
363
201 (Created)     The container has been created
364
202 (Accepted)    The request has been accepted
365
================  ===============================
366

    
367

    
368
POST
369
""""
370

    
371
======================  ============================================
372
Request Parameter Name  Value
373
======================  ============================================
374
update                  Do not replace metadata/policy (no value parameter)
375
======================  ============================================
376

    
377
|
378

    
379
====================  ================================
380
Request Header Name   Value
381
====================  ================================
382
X-Container-Policy-*  Container behavior and limits
383
X-Container-Meta-*    Optional user defined metadata
384
====================  ================================
385

    
386
No reply content/headers.
387

    
388
The operation will overwrite all user defined metadata, except if ``update`` is defined.
389
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
390

    
391
================  ===============================
392
Return Code       Description
393
================  ===============================
394
202 (Accepted)    The request has been accepted
395
================  ===============================
396

    
397

    
398
DELETE
399
""""""
400

    
401
No request parameters/headers.
402

    
403
No reply content/headers.
404

    
405
================  ===============================
406
Return Code       Description
407
================  ===============================
408
204 (No Content)  The request succeeded
409
409 (Conflict)    The container is not empty
410
================  ===============================
411

    
412

    
413
Object Level
414
^^^^^^^^^^^^
415

    
416
List of operations:
417

    
418
=========  =================================
419
Operation  Description
420
=========  =================================
421
HEAD       Retrieve object metadata
422
GET        Read object data
423
PUT        Write object data or copy/move object
424
COPY       Copy object
425
MOVE       Move object
426
POST       Update object metadata/data
427
DELETE     Delete object
428
=========  =================================
429

    
430

    
431
HEAD
432
""""
433

    
434
======================  ===================================
435
Request Parameter Name  Value
436
======================  ===================================
437
version                 Optional version identifier
438
======================  ===================================
439

    
440
|
441

    
442
==========================  ===============================
443
Reply Header Name           Value
444
==========================  ===============================
445
ETag                        The ETag of the object
446
Content-Length              The size of the object
447
Content-Type                The MIME content type of the object
448
Last-Modified               The last object modification date (regardless of version)
449
Content-Encoding            The encoding of the object (optional)
450
Content-Disposition         The presentation style of the object (optional)
451
X-Object-Version            The object's version identifier
452
X-Object-Version-Timestamp  The object's version timestamp
453
X-Object-Modified-By        The user that comitted the object's version
454
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
455
X-Object-Sharing            Object permissions (optional)
456
X-Object-Shared-By          Object inheriting permissions (optional)
457
X-Object-Public             Object's publicly accessible URI (optional)
458
X-Object-Meta-*             Optional user defined metadata
459
==========================  ===============================
460

    
461
|
462

    
463
================  ===============================
464
Return Code       Description
465
================  ===============================
466
200 (No Content)  The request succeeded
467
================  ===============================
468

    
469

    
470
GET
471
"""
472

    
473
====================  ================================
474
Request Header Name   Value
475
====================  ================================
476
Range                 Optional range of data to retrieve
477
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
478
If-Match              Retrieve if ETags match
479
If-None-Match         Retrieve if ETags don't match
480
If-Modified-Since     Retrieve if object has changed since provided timestamp
481
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
482
====================  ================================
483

    
484
|
485

    
486
======================  ===================================
487
Request Parameter Name  Value
488
======================  ===================================
489
format                  Optional extended reply type (can be ``json`` or ``xml``)
490
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
491
======================  ===================================
492

    
493
The reply is the object's data (or part of it), except if a hashmap is requested with the ``format`` parameter, or a version list with ``version=list`` (in which case an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
494

    
495
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block.
496

    
497
Example ``format=json`` reply:
498

    
499
::
500

    
501
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
502

    
503
Example ``format=xml`` reply:
504

    
505
::
506

    
507
  <?xml version="1.0" encoding="UTF-8"?>
508
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
509
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
510
    <hash>...</hash>
511
  </object>
512

    
513
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
514

    
515
Example ``format=json`` reply:
516

    
517
::
518

    
519
  {"versions": [[23, 1307700892], [28, 1307700898], ...]}
520

    
521
Example ``format=xml`` reply:
522

    
523
::
524

    
525
  <?xml version="1.0" encoding="UTF-8"?>
526
  <object name="file">
527
    <version timestamp="1307700892">23</version>
528
    <version timestamp="1307700898">28</version>
529
    <version timestamp="...">...</version>
530
  </object>
531

    
532
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
533

    
534
==========================  ===============================
535
Reply Header Name           Value
536
==========================  ===============================
537
ETag                        The ETag of the object
538
Content-Length              The size of the data returned
539
Content-Type                The MIME content type of the object
540
Content-Range               The range of data included (only on a single range request)
541
Last-Modified               The last object modification date (regardless of version)
542
Content-Encoding            The encoding of the object (optional)
543
Content-Disposition         The presentation style of the object (optional)
544
X-Object-Version            The object's version identifier
545
X-Object-Version-Timestamp  The object's version timestamp
546
X-Object-Modified-By        The user that comitted the object's version
547
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
548
X-Object-Sharing            Object permissions (optional)
549
X-Object-Shared-By          Object inheriting permissions (optional)
550
X-Object-Public             Object's publicly accessible URI (optional)
551
X-Object-Meta-*             Optional user defined metadata
552
==========================  ===============================
553

    
554
|
555

    
556
===========================  ==============================
557
Return Code                  Description
558
===========================  ==============================
559
200 (OK)                     The request succeeded
560
206 (Partial Content)        The range request succeeded
561
304 (Not Modified)           The object has not been modified
562
412 (Precondition Failed)    The condition set can not be satisfied
563
416 (Range Not Satisfiable)  The requested range is out of limits
564
===========================  ==============================
565

    
566

    
567
PUT
568
"""
569

    
570
====================  ================================
571
Request Header Name   Value
572
====================  ================================
573
ETag                  The MD5 hash of the object (optional to check written data)
574
Content-Length        The size of the data written
575
Content-Type          The MIME content type of the object
576
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
577
X-Copy-From           The source path in the form ``/<container>/<object>``
578
X-Move-From           The source path in the form ``/<container>/<object>``
579
X-Source-Version      The source version to copy from
580
Content-Encoding      The encoding of the object (optional)
581
Content-Disposition   The presentation style of the object (optional)
582
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
583
X-Object-Sharing      Object permissions (optional)
584
X-Object-Public       Object is publicly accessible (optional)
585
X-Object-Meta-*       Optional user defined metadata
586
====================  ================================
587

    
588
|
589

    
590
======================  ===================================
591
Request Parameter Name  Value
592
======================  ===================================
593
format                  Optional extended request type (can be ``json``) to create the object by suppling its hashmap instead
594
======================  ===================================
595

    
596
The request is the object's data (or part of it), except if a hashmap is provided with the ``format`` parameter.  If format is used and all different parts are stored in the server, the object is created otherwise the server returns Conflict (409) with the list of the missing parts. 
597

    
598
Hashmaps expose the underlying storage format of the object.
599

    
600
Example ``format=json`` request:
601

    
602
::
603

    
604
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
605

    
606

    
607
==========================  ===============================
608
Reply Header Name           Value
609
==========================  ===============================
610
ETag                        The MD5 hash of the object (on create)
611
==========================  ===============================
612

    
613
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
614

    
615
===========================  ==============================
616
Return Code                  Description
617
===========================  ==============================
618
201 (Created)                The object has been created
619
409 (Conflict)               The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a conflicting sharing path will be included in the reply - in JSON format)
620
411 (Length Required)        Missing ``Content-Length`` or ``Content-Type`` in the request
621
422 (Unprocessable Entity)   The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
622
===========================  ==============================
623

    
624

    
625
COPY
626
""""
627

    
628
====================  ================================
629
Request Header Name   Value
630
====================  ================================
631
Destination           The destination path in the form ``/<container>/<object>``
632
Content-Type          The MIME content type of the object (optional)
633
Content-Encoding      The encoding of the object (optional)
634
Content-Disposition   The presentation style of the object (optional)
635
X-Source-Version      The source version to copy from
636
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
637
X-Object-Sharing      Object permissions (optional)
638
X-Object-Public       Object is publicly accessible (optional)
639
X-Object-Meta-*       Optional user defined metadata
640
====================  ================================
641

    
642
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
643

    
644
No reply content/headers.
645

    
646
===========================  ==============================
647
Return Code                  Description
648
===========================  ==============================
649
201 (Created)                The object has been created
650
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
651
===========================  ==============================
652

    
653

    
654
MOVE
655
""""
656

    
657
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
658

    
659

    
660
POST
661
""""
662

    
663
======================  ============================================
664
Request Parameter Name  Value
665
======================  ============================================
666
update                  Do not replace metadata (no value parameter)
667
======================  ============================================
668

    
669
|
670

    
671
====================  ================================
672
Request Header Name   Value
673
====================  ================================
674
Content-Length        The size of the data written (optional, to update)
675
Content-Type          The MIME content type of the object (optional, to update)
676
Content-Range         The range of data supplied (optional, to update)
677
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
678
Content-Encoding      The encoding of the object (optional)
679
Content-Disposition   The presentation style of the object (optional)
680
X-Source-Object       Update with data from the object at path ``/<container>/<object>`` (optional, to update)
681
X-Source-Version      The source version to update from (optional, to update)
682
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
683
X-Object-Sharing      Object permissions (optional)
684
X-Object-Public       Object is publicly accessible (optional)
685
X-Object-Meta-*       Optional user defined metadata
686
====================  ================================
687

    
688
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
689

    
690
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
691

    
692
To update an object's data:
693

    
694
* Either set ``Content-Type`` to ``application/octet-stream``, or provide an object with ``X-Source-Object``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
695
* If the data is supplied in the request (using ``Content-Type`` instead of ``X-Source-Object``), a valid ``Content-Length`` header is required - except if using chunked transfers (set ``Transfer-Encoding`` to ``chunked``).
696
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
697

    
698
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
699
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
700
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data will be appended to the object.
701

    
702
A data update will trigger an ETag change. The new ETag will not correspond to the object's MD5 sum (**TBD**) and will be included in reply headers.
703

    
704
No reply content. No reply headers if only metadata is updated.
705

    
706
==========================  ===============================
707
Reply Header Name           Value
708
==========================  ===============================
709
ETag                        The new ETag of the object (data updated)
710
==========================  ===============================
711

    
712
|
713

    
714
===========================  ==============================
715
Return Code                  Description
716
===========================  ==============================
717
202 (Accepted)               The request has been accepted (not a data update)
718
204 (No Content)             The request succeeded (data updated)
719
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
720
411 (Length Required)        Missing ``Content-Length`` in the request
721
416 (Range Not Satisfiable)  The supplied range is invalid
722
===========================  ==============================
723

    
724

    
725
DELETE
726
""""""
727

    
728
No request parameters/headers.
729

    
730
No reply content/headers.
731

    
732
===========================  ==============================
733
Return Code                  Description
734
===========================  ==============================
735
204 (No Content)             The request succeeded
736
===========================  ==============================
737

    
738
Sharing and Public Objects
739
^^^^^^^^^^^^^^^^^^^^^^^^^^
740

    
741
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
742

    
743
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden).
744

    
745
==========================  ===============================
746
Reply Header Name           Value
747
==========================  ===============================
748
ETag                        The ETag of the object
749
Content-Length              The size of the data returned
750
Content-Type                The MIME content type of the object
751
Content-Range               The range of data included (only on a single range request)
752
Last-Modified               The last object modification date (regardless of version)
753
Content-Encoding            The encoding of the object (optional)
754
Content-Disposition         The presentation style of the object (optional)
755
==========================  ===============================
756

    
757
Summary
758
^^^^^^^
759

    
760
List of differences from the OOS API:
761

    
762
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
763
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
764
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings.
765
* Container policies to manage behavior and limits.
766
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
767
* All metadata replies, at all levels, include latest modification information.
768
* At all levels, a ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
769
* Container/object lists include all associated metadata if the reply is of type json/xml. Some names are kept to their OOS API equivalents for compatibility. 
770
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
771
* Multi-range object GET support as outlined in RFC2616.
772
* Object hashmap retrieval through GET and the ``format`` parameter.
773
* Partial object updates through POST, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers. Use another object's data to update with ``X-Source-Object`` and ``X-Source-Version``.
774
* Object ``MOVE`` support.
775
* Time-variant account/container listings via the ``until`` parameter.
776
* Object versions - parameter ``version`` in HEAD/GET (list versions with GET), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in PUT/COPY.
777
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
778
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
779
* Large object support with ``X-Object-Manifest``.
780
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
781

    
782
Clarifications/suggestions:
783

    
784
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
785
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
786
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
787
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
788
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired reply format. The parameter overrides the header.
789
* Container/object lists use a ``200`` return code if the reply is of type json/xml. The reply will include an empty json/xml.
790
* In headers, dates are formatted according to RFC 1123. In extended information listings, dates are formatted according to ISO 8601.
791
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
792
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
793
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
794

    
795
The Pithos Client
796
-----------------
797

    
798
User Experience
799
^^^^^^^^^^^^^^^
800

    
801
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
802

    
803
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
804

    
805
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
806
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
807
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
808
* The ``others`` element, which contains all objects that other users share with the user.
809
* The ``tags`` element, which lists the names of tags the user has defined. This can be an entry point to list all files that have been assigned a specific tag or manage tags in general (remove a tag completely, rename a tag etc.).
810
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
811

    
812
Objects in Pithos can be:
813

    
814
* Assigned custom tags.
815
* Moved to trash and then deleted.
816
* Shared with specific permissions.
817
* Made public (shared with non-Pithos users).
818
* Restored from previous versions.
819

    
820
Some of these functions are performed by the client software and some by the Pithos server. Client-driven functionality is based on specific metadata that should be handled equally across implementations. These metadata names are discussed in the next chapter. 
821

    
822
Conventions and Metadata Specification
823
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
824

    
825
Pithos clients should use the ``pithos`` container for all Pithos objects. Object names use the ``/`` delimiter to impose a hierarchy of folders and files.
826

    
827
At the object level, tags are implemented by managing metadata keys. The client software should allow the user to use any string as a tag and then set the corresponding ``X-Object-Meta-<tag>`` key at the server. The API extensions provided, allow for listing all tags in a container and filtering object listings based on one or more tags. The tag list is sufficient for implementing the ``tags`` element, either as a special, virtual folder (as done in the first version of Pithos), or as an application menu.
828

    
829
The metadata specification is summarized in the following table.
830

    
831
===========================  ==============================
832
Metadata Name                Value
833
===========================  ==============================
834
X-Object-Meta-*              Use for other tags that apply to the object
835
===========================  ==============================
836

    
837
Recommended Practices and Examples
838
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
839

    
840
Assuming an authentication token is obtained (**TBD**), the following high-level operations are available - shown with ``curl``:
841

    
842
* Get account information ::
843

    
844
    curl -X HEAD -D - \
845
         -H "X-Auth-Token: 0000" \
846
         https://pithos.dev.grnet.gr/v1/user
847

    
848
* List available containers ::
849

    
850
    curl -X GET -D - \
851
         -H "X-Auth-Token: 0000" \
852
         https://pithos.dev.grnet.gr/v1/user
853

    
854
* Get container information ::
855

    
856
    curl -X HEAD -D - \
857
         -H "X-Auth-Token: 0000" \
858
         https://pithos.dev.grnet.gr/v1/user/pithos
859

    
860
* Add a new container ::
861

    
862
    curl -X PUT -D - \
863
         -H "X-Auth-Token: 0000" \
864
         https://pithos.dev.grnet.gr/v1/user/test
865

    
866
* Delete a container ::
867

    
868
    curl -X DELETE -D - \
869
         -H "X-Auth-Token: 0000" \
870
         https://pithos.dev.grnet.gr/v1/user/test
871

    
872
* List objects in a container ::
873

    
874
    curl -X GET -D - \
875
         -H "X-Auth-Token: 0000" \
876
         https://pithos.dev.grnet.gr/v1/user/pithos
877

    
878
* List objects in a container (extended reply) ::
879

    
880
    curl -X GET -D - \
881
         -H "X-Auth-Token: 0000" \
882
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
883

    
884
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
885

    
886
* List metadata keys used by objects in a container
887

    
888
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``).
889

    
890
* List objects in a container having a specific meta defined ::
891

    
892
    curl -X GET -D - \
893
         -H "X-Auth-Token: 0000" \
894
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=favorites
895

    
896
* Retrieve an object ::
897

    
898
    curl -X GET -D - \
899
         -H "X-Auth-Token: 0000" \
900
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
901

    
902
* Retrieve an object (specific ranges of data) ::
903

    
904
    curl -X GET -D - \
905
         -H "X-Auth-Token: 0000" \
906
         -H "Range: bytes=0-9" \
907
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
908

    
909
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
910

    
911
* Add a new object (folder type) (**TBD**) ::
912

    
913
    curl -X PUT -D - \
914
         -H "X-Auth-Token: 0000" \
915
         -H "Content-Type: application/folder" \
916
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
917

    
918
* Add a new object ::
919

    
920
    curl -X PUT -D - \
921
         -H "X-Auth-Token: 0000" \
922
         -H "Content-Type: text/plain" \
923
         -T EXAMPLE.txt
924
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
925

    
926
* Update an object ::
927

    
928
    curl -X POST -D - \
929
         -H "X-Auth-Token: 0000" \
930
         -H "Content-Length: 10" \
931
         -H "Content-Type: application/octet-stream" \
932
         -H "Content-Range: bytes 10-19/*" \
933
         -d "0123456789" \
934
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
935

    
936
  This will update bytes 10-19 with the data specified.
937

    
938
* Update an object (append) ::
939

    
940
    curl -X POST -D - \
941
         -H "X-Auth-Token: 0000" \
942
         -H "Content-Length: 10" \
943
         -H "Content-Type: application/octet-stream" \
944
         -H "Content-Range: bytes */*" \
945
         -d "0123456789" \
946
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
947

    
948
* Add object metadata ::
949

    
950
    curl -X POST -D - \
951
         -H "X-Auth-Token: 0000" \
952
         -H "X-Object-Meta-First: first_meta_value" \
953
         -H "X-Object-Meta-Second: second_meta_value" \
954
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
955

    
956
* Delete object metadata ::
957

    
958
    curl -X POST -D - \
959
         -H "X-Auth-Token: 0000" \
960
         -H "X-Object-Meta-First: first_meta_value" \
961
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
962

    
963
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
964

    
965
* Delete an object ::
966

    
967
    curl -X DELETE -D - \
968
         -H "X-Auth-Token: 0000" \
969
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt