Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ b0a2d1a6

History | View | Annotate | Download (50.8 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
Document Revisions
23
^^^^^^^^^^^^^^^^^^
24

    
25
=========================  ================================
26
Revision                   Description
27
=========================  ================================
28
0.5 (July 16, 2011)        Object update from another object's data.
29
\                          Support object truncate.
30
\                          Create object using a standard HTML form.
31
\                          Purge container/object history.
32
0.4 (July 01, 2011)        Object permissions and account groups.
33
\                          Control versioning behavior and container quotas with container policy directives.
34
\                          Support updating/deleting individual metadata with ``POST``.
35
\                          Create object using hashmap.
36
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
37
\                          Allow for publicly available objects via ``https://hostname/public``.
38
\                          Support time-variant account/container listings. 
39
\                          Add source version when duplicating with ``PUT``/``COPY``.
40
\                          Request version in object ``HEAD``/``GET`` requests (list versions with ``GET``).
41
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
42
\                          Include underlying storage characteristics in container meta.
43
\                          Support for partial object updates through ``POST``.
44
\                          Expose object hashmaps through ``GET``.
45
\                          Support for multi-range object ``GET`` requests.
46
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
47
=========================  ================================
48

    
49
The Pithos API
50
--------------
51

    
52
The URI requests supported by the Pithos API follow one of the following forms:
53

    
54
* Top level: ``https://hostname/v1/``
55
* Account level: ``https://hostname/v1/<account>``
56
* Container level: ``https://hostname/v1/<account>/<container>``
57
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
58

    
59
All requests must include an ``X-Auth-Token`` - as a header, or a parameter. The process of obtaining the token is still to be determined (**TBD**).
60

    
61
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
62

    
63
=========================  ================================
64
Return Code                Description
65
=========================  ================================
66
400 (Bad Request)          The request is invalid
67
401 (Unauthorized)         Request not allowed
68
404 (Not Found)            The requested resource was not found
69
503 (Service Unavailable)  The request cannot be completed because of an internal error
70
=========================  ================================
71

    
72
Top Level
73
^^^^^^^^^
74

    
75
List of operations:
76

    
77
=========  ==================
78
Operation  Description
79
=========  ==================
80
GET        Authentication. This is kept for compatibility with the OOS API
81
=========  ==================
82

    
83
GET
84
"""
85

    
86
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
87

    
88
================  =====================
89
Return Code       Description
90
================  =====================
91
204 (No Content)  The request succeeded
92
================  =====================
93

    
94

    
95
Account Level
96
^^^^^^^^^^^^^
97

    
98
List of operations:
99

    
100
=========  ==================
101
Operation  Description
102
=========  ==================
103
HEAD       Retrieve account metadata
104
GET        List containers
105
POST       Update account metadata
106
=========  ==================
107

    
108
HEAD
109
""""
110

    
111
======================  ===================================
112
Request Parameter Name  Value
113
======================  ===================================
114
until                   Optional timestamp
115
======================  ===================================
116

    
117
|
118

    
119
==========================  =====================
120
Reply Header Name           Value
121
==========================  =====================
122
X-Account-Container-Count   The total number of containers
123
X-Account-Object-Count      The total number of objects (**TBD**)
124
X-Account-Bytes-Used        The total number of bytes stored
125
X-Account-Bytes-Remaining   The total number of bytes remaining (**TBD**)
126
X-Account-Last-Login        The last login (**TBD**)
127
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
128
X-Account-Group-*           Optional user defined groups
129
X-Account-Meta-*            Optional user defined metadata
130
Last-Modified               The last account modification date (regardless of ``until``)
131
==========================  =====================
132

    
133
|
134

    
135
================  =====================
136
Return Code       Description
137
================  =====================
138
204 (No Content)  The request succeeded
139
================  =====================
140

    
141

    
142
GET
143
"""
144

    
145
====================  ===========================
146
Request Header Name   Value
147
====================  ===========================
148
If-Modified-Since     Retrieve if account has changed since provided timestamp
149
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
150
====================  ===========================
151

    
152
|
153

    
154
======================  =========================
155
Request Parameter Name  Value
156
======================  =========================
157
limit                   The amount of results requested (default is 10000)
158
marker                  Return containers with name lexicographically after marker
159
format                  Optional extended reply type (can be ``json`` or ``xml``)
160
until                   Optional timestamp
161
======================  =========================
162

    
163
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
164
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
165
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
166

    
167
===========================  ============================
168
Name                         Description
169
===========================  ============================
170
name                         The name of the container
171
count                        The number of objects inside the container
172
bytes                        The total size of the objects inside the container
173
last_modified                The last container modification date (regardless of ``until``)
174
x_container_until_timestamp  The last container modification date until the timestamp provided
175
x_container_policy_*         Container behavior and limits
176
x_container_meta_*           Optional user defined metadata
177
===========================  ============================
178

    
179
For examples of container details returned in JSON/XML formats refer to the OOS API documentation.
180

    
181
===========================  =====================
182
Return Code                  Description
183
===========================  =====================
184
200 (OK)                     The request succeeded
185
204 (No Content)             The account has no containers (only for non-extended replies)
186
304 (Not Modified)           The account has not been modified
187
412 (Precondition Failed)    The condition set can not be satisfied
188
===========================  =====================
189

    
190
Will use a ``200`` return code if the reply is of type json/xml.
191

    
192

    
193
POST
194
""""
195

    
196
======================  ============================================
197
Request Parameter Name  Value
198
======================  ============================================
199
update                  Do not replace metadata/groups (no value parameter)
200
======================  ============================================
201

    
202
|
203

    
204
====================  ===========================
205
Request Header Name   Value
206
====================  ===========================
207
X-Account-Group-*     Optional user defined groups
208
X-Account-Meta-*      Optional user defined metadata
209
====================  ===========================
210

    
211
No reply content/headers.
212

    
213
The operation will overwrite all user defined metadata, except if ``update`` is defined.
214
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
215

    
216
================  ===============================
217
Return Code       Description
218
================  ===============================
219
202 (Accepted)    The request has been accepted
220
================  ===============================
221

    
222

    
223
Container Level
224
^^^^^^^^^^^^^^^
225

    
226
List of operations:
227

    
228
=========  ============================
229
Operation  Description
230
=========  ============================
231
HEAD       Retrieve container metadata
232
GET        List objects
233
PUT        Create/update container
234
POST       Update container metadata
235
DELETE     Delete container
236
=========  ============================
237

    
238

    
239
HEAD
240
""""
241

    
242
======================  ===================================
243
Request Parameter Name  Value
244
======================  ===================================
245
until                   Optional timestamp
246
======================  ===================================
247

    
248
|
249

    
250
===========================  ===============================
251
Reply Header Name            Value
252
===========================  ===============================
253
X-Container-Object-Count     The total number of objects in the container
254
X-Container-Bytes-Used       The total number of bytes of all objects stored
255
X-Container-Block-Size       The block size used by the storage backend
256
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
257
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
258
X-Container-Object-Meta      A list with all meta keys used by objects
259
X-Container-Policy-*         Container behavior and limits
260
X-Container-Meta-*           Optional user defined metadata
261
Last-Modified                The last container modification date (regardless of ``until``)
262
===========================  ===============================
263

    
264
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix. See container ``PUT`` for a reference of policy directives.
265

    
266
================  ===============================
267
Return Code       Description
268
================  ===============================
269
204 (No Content)  The request succeeded
270
================  ===============================
271

    
272

    
273
GET
274
"""
275

    
276
====================  ===========================
277
Request Header Name   Value
278
====================  ===========================
279
If-Modified-Since     Retrieve if container has changed since provided timestamp
280
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
281
====================  ===========================
282

    
283
|
284

    
285
======================  ===================================
286
Request Parameter Name  Value
287
======================  ===================================
288
limit                   The amount of results requested (default is 10000)
289
marker                  Return containers with name lexicographically after marker
290
prefix                  Return objects starting with prefix
291
delimiter               Return objects up to the delimiter (discussion follows)
292
path                    Assume ``prefix=path`` and ``delimiter=/``
293
format                  Optional extended reply type (can be ``json`` or ``xml``)
294
meta                    Return objects having the specified meta keys (can be a comma separated list)
295
until                   Optional timestamp
296
======================  ===================================
297

    
298
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
299

    
300
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
301

    
302
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
303
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
304
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
305

    
306
==========================  ======================================
307
Name                        Description
308
==========================  ======================================
309
name                        The name of the object
310
hash                        The ETag of the object
311
bytes                       The size of the object
312
content_type                The MIME content type of the object
313
content_encoding            The encoding of the object (optional)
314
content-disposition         The presentation style of the object (optional)
315
last_modified               The last object modification date (regardless of version)
316
x_object_version            The object's version identifier
317
x_object_version_timestamp  The object's version timestamp
318
x_object_modified_by        The user that committed the object's version
319
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
320
x_object_sharing            Object permissions (optional)
321
x_object_shared_by          Object inheriting permissions (optional)
322
x_object_public             Object's publicly accessible URI (optional)
323
x_object_meta_*             Optional user defined metadata
324
==========================  ======================================
325

    
326
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
327
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
328
In JSON results they appear as dictionaries with only a ``"subdir"`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
329
In case there is an object with the same name as a virtual directory marker, the object will be returned.
330

    
331
For examples of object details returned in JSON/XML formats refer to the OOS API documentation.
332

    
333
===========================  ===============================
334
Return Code                  Description
335
===========================  ===============================
336
200 (OK)                     The request succeeded
337
204 (No Content)             The account has no containers (only for non-extended replies)
338
304 (Not Modified)           The container has not been modified
339
412 (Precondition Failed)    The condition set can not be satisfied
340
===========================  ===============================
341

    
342
Will use a ``200`` return code if the reply is of type json/xml.
343

    
344

    
345
PUT
346
"""
347

    
348
====================  ================================
349
Request Header Name   Value
350
====================  ================================
351
X-Container-Policy-*  Container behavior and limits
352
X-Container-Meta-*    Optional user defined metadata
353
====================  ================================
354
 
355
No reply content/headers.
356

    
357
If no policy is defined, the container will be created with the default values.
358
Available policy directives:
359

    
360
* ``versioning``: Set to ``auto``, ``manual`` or ``none`` (default is ``manual``)
361
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
362
 
363
================  ===============================
364
Return Code       Description
365
================  ===============================
366
201 (Created)     The container has been created
367
202 (Accepted)    The request has been accepted
368
================  ===============================
369

    
370

    
371
POST
372
""""
373

    
374
======================  ============================================
375
Request Parameter Name  Value
376
======================  ============================================
377
update                  Do not replace metadata/policy (no value parameter)
378
======================  ============================================
379

    
380
|
381

    
382
====================  ================================
383
Request Header Name   Value
384
====================  ================================
385
X-Container-Policy-*  Container behavior and limits
386
X-Container-Meta-*    Optional user defined metadata
387
====================  ================================
388

    
389
No reply content/headers.
390

    
391
The operation will overwrite all user defined metadata, except if ``update`` is defined.
392
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
393

    
394
================  ===============================
395
Return Code       Description
396
================  ===============================
397
202 (Accepted)    The request has been accepted
398
================  ===============================
399

    
400

    
401
DELETE
402
""""""
403

    
404
======================  ===================================
405
Request Parameter Name  Value
406
======================  ===================================
407
until                   Optional timestamp
408
======================  ===================================
409

    
410
If ``until`` is defined, the container is "purged" up to that time (the history of all objects up to then is deleted).
411

    
412
No reply content/headers.
413

    
414
================  ===============================
415
Return Code       Description
416
================  ===============================
417
204 (No Content)  The request succeeded
418
409 (Conflict)    The container is not empty
419
================  ===============================
420

    
421

    
422
Object Level
423
^^^^^^^^^^^^
424

    
425
List of operations:
426

    
427
=========  =================================
428
Operation  Description
429
=========  =================================
430
HEAD       Retrieve object metadata
431
GET        Read object data
432
PUT        Write object data or copy/move object
433
COPY       Copy object
434
MOVE       Move object
435
POST       Update object metadata/data
436
DELETE     Delete object
437
=========  =================================
438

    
439

    
440
HEAD
441
""""
442

    
443
======================  ===================================
444
Request Parameter Name  Value
445
======================  ===================================
446
version                 Optional version identifier
447
======================  ===================================
448

    
449
|
450

    
451
==========================  ===============================
452
Reply Header Name           Value
453
==========================  ===============================
454
ETag                        The ETag of the object
455
Content-Length              The size of the object
456
Content-Type                The MIME content type of the object
457
Last-Modified               The last object modification date (regardless of version)
458
Content-Encoding            The encoding of the object (optional)
459
Content-Disposition         The presentation style of the object (optional)
460
X-Object-Version            The object's version identifier
461
X-Object-Version-Timestamp  The object's version timestamp
462
X-Object-Modified-By        The user that comitted the object's version
463
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
464
X-Object-Sharing            Object permissions (optional)
465
X-Object-Shared-By          Object inheriting permissions (optional)
466
X-Object-Public             Object's publicly accessible URI (optional)
467
X-Object-Meta-*             Optional user defined metadata
468
==========================  ===============================
469

    
470
|
471

    
472
================  ===============================
473
Return Code       Description
474
================  ===============================
475
200 (No Content)  The request succeeded
476
================  ===============================
477

    
478

    
479
GET
480
"""
481

    
482
====================  ================================
483
Request Header Name   Value
484
====================  ================================
485
Range                 Optional range of data to retrieve
486
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
487
If-Match              Retrieve if ETags match
488
If-None-Match         Retrieve if ETags don't match
489
If-Modified-Since     Retrieve if object has changed since provided timestamp
490
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
491
====================  ================================
492

    
493
|
494

    
495
======================  ===================================
496
Request Parameter Name  Value
497
======================  ===================================
498
format                  Optional extended reply type (can be ``json`` or ``xml``)
499
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
500
======================  ===================================
501

    
502
The reply is the object's data (or part of it), except if a hashmap is requested with the ``format`` parameter, or a version list with ``version=list`` (in which case an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
503

    
504
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block.
505

    
506
Example ``format=json`` reply:
507

    
508
::
509

    
510
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
511

    
512
Example ``format=xml`` reply:
513

    
514
::
515

    
516
  <?xml version="1.0" encoding="UTF-8"?>
517
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
518
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
519
    <hash>...</hash>
520
  </object>
521

    
522
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
523

    
524
Example ``format=json`` reply:
525

    
526
::
527

    
528
  {"versions": [[23, 1307700892], [28, 1307700898], ...]}
529

    
530
Example ``format=xml`` reply:
531

    
532
::
533

    
534
  <?xml version="1.0" encoding="UTF-8"?>
535
  <object name="file">
536
    <version timestamp="1307700892">23</version>
537
    <version timestamp="1307700898">28</version>
538
    <version timestamp="...">...</version>
539
  </object>
540

    
541
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
542

    
543
==========================  ===============================
544
Reply Header Name           Value
545
==========================  ===============================
546
ETag                        The ETag of the object
547
Content-Length              The size of the data returned
548
Content-Type                The MIME content type of the object
549
Content-Range               The range of data included (only on a single range request)
550
Last-Modified               The last object modification date (regardless of version)
551
Content-Encoding            The encoding of the object (optional)
552
Content-Disposition         The presentation style of the object (optional)
553
X-Object-Version            The object's version identifier
554
X-Object-Version-Timestamp  The object's version timestamp
555
X-Object-Modified-By        The user that comitted the object's version
556
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
557
X-Object-Sharing            Object permissions (optional)
558
X-Object-Shared-By          Object inheriting permissions (optional)
559
X-Object-Public             Object's publicly accessible URI (optional)
560
X-Object-Meta-*             Optional user defined metadata
561
==========================  ===============================
562

    
563
|
564

    
565
===========================  ==============================
566
Return Code                  Description
567
===========================  ==============================
568
200 (OK)                     The request succeeded
569
206 (Partial Content)        The range request succeeded
570
304 (Not Modified)           The object has not been modified
571
412 (Precondition Failed)    The condition set can not be satisfied
572
416 (Range Not Satisfiable)  The requested range is out of limits
573
===========================  ==============================
574

    
575

    
576
PUT
577
"""
578

    
579
====================  ================================
580
Request Header Name   Value
581
====================  ================================
582
ETag                  The MD5 hash of the object (optional to check written data)
583
Content-Length        The size of the data written
584
Content-Type          The MIME content type of the object
585
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
586
X-Copy-From           The source path in the form ``/<container>/<object>``
587
X-Move-From           The source path in the form ``/<container>/<object>``
588
X-Source-Version      The source version to copy from
589
Content-Encoding      The encoding of the object (optional)
590
Content-Disposition   The presentation style of the object (optional)
591
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
592
X-Object-Sharing      Object permissions (optional)
593
X-Object-Public       Object is publicly accessible (optional)
594
X-Object-Meta-*       Optional user defined metadata
595
====================  ================================
596

    
597
|
598

    
599
======================  ===================================
600
Request Parameter Name  Value
601
======================  ===================================
602
format                  Optional extended request type (can be ``json``) to create the object by suppling its hashmap instead
603
======================  ===================================
604

    
605
The request is the object's data (or part of it), except if a hashmap is provided with the ``format`` parameter.  If format is used and all different parts are stored in the server, the object is created, otherwise the server returns Conflict (409) with the list of the missing parts. 
606

    
607
Hashmaps expose the underlying storage format of the object.
608

    
609
Example ``format=json`` request:
610

    
611
::
612

    
613
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
614

    
615
Example ``format=xml`` request:
616

    
617
::
618

    
619
  <?xml version="1.0" encoding="UTF-8"?>
620
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
621
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
622
    <hash>...</hash>
623
  </object>
624

    
625
==========================  ===============================
626
Reply Header Name           Value
627
==========================  ===============================
628
ETag                        The MD5 hash of the object (on create)
629
==========================  ===============================
630

    
631
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). Groups are specified as ``<account>:<group>``. To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
632

    
633
===========================  ==============================
634
Return Code                  Description
635
===========================  ==============================
636
201 (Created)                The object has been created
637
409 (Conflict)               The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a conflicting sharing path will be included in the reply - in JSON format)
638
411 (Length Required)        Missing ``Content-Length`` or ``Content-Type`` in the request
639
422 (Unprocessable Entity)   The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
640
===========================  ==============================
641

    
642

    
643
COPY
644
""""
645

    
646
====================  ================================
647
Request Header Name   Value
648
====================  ================================
649
Destination           The destination path in the form ``/<container>/<object>``
650
Content-Type          The MIME content type of the object (optional)
651
Content-Encoding      The encoding of the object (optional)
652
Content-Disposition   The presentation style of the object (optional)
653
X-Source-Version      The source version to copy from
654
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
655
X-Object-Sharing      Object permissions (optional)
656
X-Object-Public       Object is publicly accessible (optional)
657
X-Object-Meta-*       Optional user defined metadata
658
====================  ================================
659

    
660
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
661

    
662
No reply content/headers.
663

    
664
===========================  ==============================
665
Return Code                  Description
666
===========================  ==============================
667
201 (Created)                The object has been created
668
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
669
===========================  ==============================
670

    
671

    
672
MOVE
673
""""
674

    
675
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
676

    
677

    
678
POST
679
""""
680

    
681
======================  ============================================
682
Request Parameter Name  Value
683
======================  ============================================
684
update                  Do not replace metadata (no value parameter)
685
======================  ============================================
686

    
687
|
688

    
689
====================  ================================
690
Request Header Name   Value
691
====================  ================================
692
Content-Length        The size of the data written (optional, to update)
693
Content-Type          The MIME content type of the object (optional, to update)
694
Content-Range         The range of data supplied (optional, to update)
695
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
696
Content-Encoding      The encoding of the object (optional)
697
Content-Disposition   The presentation style of the object (optional)
698
X-Source-Object       Update with data from the object at path ``/<container>/<object>`` (optional, to update)
699
X-Source-Version      The source version to update from (optional, to update)
700
X-Object-Bytes        The updated object's final size (optional, when updating)
701
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
702
X-Object-Sharing      Object permissions (optional)
703
X-Object-Public       Object is publicly accessible (optional)
704
X-Object-Meta-*       Optional user defined metadata
705
====================  ================================
706

    
707
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
708

    
709
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
710

    
711
To update an object's data:
712

    
713
* Either set ``Content-Type`` to ``application/octet-stream``, or provide an object with ``X-Source-Object``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
714
* If the data is supplied in the request (using ``Content-Type`` instead of ``X-Source-Object``), a valid ``Content-Length`` header is required - except if using chunked transfers (set ``Transfer-Encoding`` to ``chunked``).
715
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
716

    
717
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
718
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
719
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data will be appended to the object.
720

    
721
Optionally, truncate the updated object to the desired length with the ``X-Object-Bytes`` header.
722

    
723
A data update will trigger an ETag change. The new ETag will not correspond to the object's MD5 sum (**TBD**) and will be included in reply headers.
724

    
725
No reply content. No reply headers if only metadata is updated.
726

    
727
==========================  ===============================
728
Reply Header Name           Value
729
==========================  ===============================
730
ETag                        The new ETag of the object (data updated)
731
==========================  ===============================
732

    
733
|
734

    
735
===========================  ==============================
736
Return Code                  Description
737
===========================  ==============================
738
202 (Accepted)               The request has been accepted (not a data update)
739
204 (No Content)             The request succeeded (data updated)
740
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
741
411 (Length Required)        Missing ``Content-Length`` in the request
742
416 (Range Not Satisfiable)  The supplied range is invalid
743
===========================  ==============================
744

    
745
The ``POST`` method can also be used for creating an object via a standard HTML form. If the request ``Content-Type`` is ``multipart/form-data``, none of the above headers will be processed. The form should have exactly two fields, as in the following example. ::
746

    
747
  <form method="post" action="https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt" enctype="multipart/form-data">
748
    <input type="hidden" name="X-Auth-Token" value="0000">
749
    <input type="file" name="X-Object-Data">
750
    <input type="submit">
751
  </form>
752

    
753
This will create/override the object with the given name, as if using ``PUT``. The ``Content-Type`` of the object will be set to the value of the corresponding header sent in the part of the request containing the data. Metadata, sharing and other object attributes can not be set this way.
754

    
755
==========================  ===============================
756
Reply Header Name           Value
757
==========================  ===============================
758
ETag                        The MD5 hash of the object
759
==========================  ===============================
760

    
761
|
762

    
763
===========================  ==============================
764
Return Code                  Description
765
===========================  ==============================
766
201 (Created)                The object has been created
767
===========================  ==============================
768

    
769

    
770
DELETE
771
""""""
772

    
773
======================  ===================================
774
Request Parameter Name  Value
775
======================  ===================================
776
until                   Optional timestamp
777
======================  ===================================
778

    
779
If ``until`` is defined, the object is "purged" up to that time (the history up to then is deleted).
780

    
781
No reply content/headers.
782

    
783
===========================  ==============================
784
Return Code                  Description
785
===========================  ==============================
786
204 (No Content)             The request succeeded
787
===========================  ==============================
788

    
789
Sharing and Public Objects
790
^^^^^^^^^^^^^^^^^^^^^^^^^^
791

    
792
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
793

    
794
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden).
795

    
796
==========================  ===============================
797
Reply Header Name           Value
798
==========================  ===============================
799
ETag                        The ETag of the object
800
Content-Length              The size of the data returned
801
Content-Type                The MIME content type of the object
802
Content-Range               The range of data included (only on a single range request)
803
Last-Modified               The last object modification date (regardless of version)
804
Content-Encoding            The encoding of the object (optional)
805
Content-Disposition         The presentation style of the object (optional)
806
==========================  ===============================
807

    
808
Summary
809
^^^^^^^
810

    
811
List of differences from the OOS API:
812

    
813
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
814
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
815
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings.
816
* Container policies to manage behavior and limits.
817
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
818
* All metadata replies, at all levels, include latest modification information.
819
* At all levels, a ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
820
* Container/object lists include all associated metadata if the reply is of type json/xml. Some names are kept to their OOS API equivalents for compatibility. 
821
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
822
* Multi-range object ``GET`` support as outlined in RFC2616.
823
* Object hashmap retrieval through ``GET`` and the ``format`` parameter.
824
* Object create via hashmap through ``PUT`` and the ``format`` parameter.
825
* Object create using ``POST`` to support standard HTML forms.
826
* Partial object updates through ``POST``, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers. Use another object's data to update with ``X-Source-Object`` and ``X-Source-Version``. Truncate with ``X-Object-Bytes``.
827
* Object ``MOVE`` support.
828
* Time-variant account/container listings via the ``until`` parameter.
829
* Object versions - parameter ``version`` in ``HEAD``/``GET`` (list versions with ``GET``), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in ``PUT``/``COPY``.
830
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
831
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
832
* Large object support with ``X-Object-Manifest``.
833
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
834
* Purge container/object history with the ``until`` parameter in ``DELETE``.
835

    
836
Clarifications/suggestions:
837

    
838
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
839
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
840
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
841
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
842
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired reply format. The parameter overrides the header (**TBD**).
843
* Container/object lists use a ``200`` return code if the reply is of type json/xml. The reply will include an empty json/xml.
844
* In headers, dates are formatted according to RFC 1123. In extended information listings, dates are formatted according to ISO 8601.
845
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
846
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
847
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
848

    
849
The Pithos Client
850
-----------------
851

    
852
User Experience
853
^^^^^^^^^^^^^^^
854

    
855
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
856

    
857
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
858

    
859
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
860
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
861
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
862
* The ``others`` element, which contains all objects that other users share with the user.
863
* The ``tags`` element, which lists the names of tags the user has defined. This can be an entry point to list all files that have been assigned a specific tag or manage tags in general (remove a tag completely, rename a tag etc.).
864
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
865

    
866
Objects in Pithos can be:
867

    
868
* Assigned custom tags.
869
* Moved to trash and then deleted.
870
* Shared with specific permissions.
871
* Made public (shared with non-Pithos users).
872
* Restored from previous versions.
873

    
874
Some of these functions are performed by the client software and some by the Pithos server. Client-driven functionality is based on specific metadata that should be handled equally across implementations. These metadata names are discussed in the next chapter. 
875

    
876
Conventions and Metadata Specification
877
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
878

    
879
Pithos clients should use the ``pithos`` container for all Pithos objects. Object names use the ``/`` delimiter to impose a hierarchy of folders and files.
880

    
881
At the object level, tags are implemented by managing metadata keys. The client software should allow the user to use any string as a tag and then set the corresponding ``X-Object-Meta-<tag>`` key at the server. The API extensions provided, allow for listing all tags in a container and filtering object listings based on one or more tags. The tag list is sufficient for implementing the ``tags`` element, either as a special, virtual folder (as done in the first version of Pithos), or as an application menu.
882

    
883
The metadata specification is summarized in the following table.
884

    
885
===========================  ==============================
886
Metadata Name                Value
887
===========================  ==============================
888
X-Object-Meta-*              Use for other tags that apply to the object
889
===========================  ==============================
890

    
891
Recommended Practices and Examples
892
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
893

    
894
Assuming an authentication token is obtained (**TBD**), the following high-level operations are available - shown with ``curl``:
895

    
896
* Get account information ::
897

    
898
    curl -X HEAD -D - \
899
         -H "X-Auth-Token: 0000" \
900
         https://pithos.dev.grnet.gr/v1/user
901

    
902
* List available containers ::
903

    
904
    curl -X GET -D - \
905
         -H "X-Auth-Token: 0000" \
906
         https://pithos.dev.grnet.gr/v1/user
907

    
908
* Get container information ::
909

    
910
    curl -X HEAD -D - \
911
         -H "X-Auth-Token: 0000" \
912
         https://pithos.dev.grnet.gr/v1/user/pithos
913

    
914
* Add a new container ::
915

    
916
    curl -X PUT -D - \
917
         -H "X-Auth-Token: 0000" \
918
         https://pithos.dev.grnet.gr/v1/user/test
919

    
920
* Delete a container ::
921

    
922
    curl -X DELETE -D - \
923
         -H "X-Auth-Token: 0000" \
924
         https://pithos.dev.grnet.gr/v1/user/test
925

    
926
* List objects in a container ::
927

    
928
    curl -X GET -D - \
929
         -H "X-Auth-Token: 0000" \
930
         https://pithos.dev.grnet.gr/v1/user/pithos
931

    
932
* List objects in a container (extended reply) ::
933

    
934
    curl -X GET -D - \
935
         -H "X-Auth-Token: 0000" \
936
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
937

    
938
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
939

    
940
* List metadata keys used by objects in a container
941

    
942
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``).
943

    
944
* List objects in a container having a specific meta defined ::
945

    
946
    curl -X GET -D - \
947
         -H "X-Auth-Token: 0000" \
948
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=favorites
949

    
950
* Retrieve an object ::
951

    
952
    curl -X GET -D - \
953
         -H "X-Auth-Token: 0000" \
954
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
955

    
956
* Retrieve an object (specific ranges of data) ::
957

    
958
    curl -X GET -D - \
959
         -H "X-Auth-Token: 0000" \
960
         -H "Range: bytes=0-9" \
961
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
962

    
963
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
964

    
965
* Add a new object (folder type) (**TBD**) ::
966

    
967
    curl -X PUT -D - \
968
         -H "X-Auth-Token: 0000" \
969
         -H "Content-Type: application/folder" \
970
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
971

    
972
* Add a new object ::
973

    
974
    curl -X PUT -D - \
975
         -H "X-Auth-Token: 0000" \
976
         -H "Content-Type: text/plain" \
977
         -T EXAMPLE.txt
978
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
979

    
980
* Update an object ::
981

    
982
    curl -X POST -D - \
983
         -H "X-Auth-Token: 0000" \
984
         -H "Content-Length: 10" \
985
         -H "Content-Type: application/octet-stream" \
986
         -H "Content-Range: bytes 10-19/*" \
987
         -d "0123456789" \
988
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
989

    
990
  This will update bytes 10-19 with the data specified.
991

    
992
* Update an object (append) ::
993

    
994
    curl -X POST -D - \
995
         -H "X-Auth-Token: 0000" \
996
         -H "Content-Length: 10" \
997
         -H "Content-Type: application/octet-stream" \
998
         -H "Content-Range: bytes */*" \
999
         -d "0123456789" \
1000
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1001

    
1002
* Update an object (truncate) ::
1003

    
1004
    curl -X POST -D - \
1005
         -H "X-Auth-Token: 0000" \
1006
         -H "X-Source-Object: /folder/EXAMPLE.txt" \
1007
         -H "Content-Range: bytes 0-0/*" \
1008
         -H "X-Object-Bytes: 0" \
1009
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1010

    
1011
  This will truncate the object to 0 bytes.
1012

    
1013
* Add object metadata ::
1014

    
1015
    curl -X POST -D - \
1016
         -H "X-Auth-Token: 0000" \
1017
         -H "X-Object-Meta-First: first_meta_value" \
1018
         -H "X-Object-Meta-Second: second_meta_value" \
1019
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1020

    
1021
* Delete object metadata ::
1022

    
1023
    curl -X POST -D - \
1024
         -H "X-Auth-Token: 0000" \
1025
         -H "X-Object-Meta-First: first_meta_value" \
1026
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1027

    
1028
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
1029

    
1030
* Delete an object ::
1031

    
1032
    curl -X DELETE -D - \
1033
         -H "X-Auth-Token: 0000" \
1034
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt