Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ 1d5c57d3

History | View | Annotate | Download (49.7 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
Document Revisions
23
^^^^^^^^^^^^^^^^^^
24

    
25
=========================  ================================
26
Revision                   Description
27
=========================  ================================
28
0.5 (July 14, 2011)        Object update from another object's data.
29
\                          Support object truncate.
30
\                          Create object using a standard HTML form.
31
0.4 (July 01, 2011)        Object permissions and account groups.
32
\                          Control versioning behavior and container quotas with container policy directives.
33
\                          Support updating/deleting individual metadata with ``POST``.
34
\                          Create object using hashmap.
35
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
36
\                          Allow for publicly available objects via ``https://hostname/public``.
37
\                          Support time-variant account/container listings. 
38
\                          Add source version when duplicating with ``PUT``/``COPY``.
39
\                          Request version in object ``HEAD``/``GET`` requests (list versions with ``GET``).
40
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
41
\                          Include underlying storage characteristics in container meta.
42
\                          Support for partial object updates through ``POST``.
43
\                          Expose object hashmaps through ``GET``.
44
\                          Support for multi-range object ``GET`` requests.
45
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
46
=========================  ================================
47

    
48
The Pithos API
49
--------------
50

    
51
The URI requests supported by the Pithos API follow one of the following forms:
52

    
53
* Top level: ``https://hostname/v1/``
54
* Account level: ``https://hostname/v1/<account>``
55
* Container level: ``https://hostname/v1/<account>/<container>``
56
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
57

    
58
All requests must include an ``X-Auth-Token``. The process of obtaining the token is still to be determined (**TBD**).
59

    
60
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
61

    
62
=========================  ================================
63
Return Code                Description
64
=========================  ================================
65
400 (Bad Request)          The request is invalid
66
401 (Unauthorized)         Request not allowed
67
404 (Not Found)            The requested resource was not found
68
503 (Service Unavailable)  The request cannot be completed because of an internal error
69
=========================  ================================
70

    
71
Top Level
72
^^^^^^^^^
73

    
74
List of operations:
75

    
76
=========  ==================
77
Operation  Description
78
=========  ==================
79
GET        Authentication. This is kept for compatibility with the OOS API
80
=========  ==================
81

    
82
GET
83
"""
84

    
85
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
86

    
87
================  =====================
88
Return Code       Description
89
================  =====================
90
204 (No Content)  The request succeeded
91
================  =====================
92

    
93

    
94
Account Level
95
^^^^^^^^^^^^^
96

    
97
List of operations:
98

    
99
=========  ==================
100
Operation  Description
101
=========  ==================
102
HEAD       Retrieve account metadata
103
GET        List containers
104
POST       Update account metadata
105
=========  ==================
106

    
107
HEAD
108
""""
109

    
110
======================  ===================================
111
Request Parameter Name  Value
112
======================  ===================================
113
until                   Optional timestamp
114
======================  ===================================
115

    
116
|
117

    
118
==========================  =====================
119
Reply Header Name           Value
120
==========================  =====================
121
X-Account-Container-Count   The total number of containers
122
X-Account-Object-Count      The total number of objects (**TBD**)
123
X-Account-Bytes-Used        The total number of bytes stored
124
X-Account-Bytes-Remaining   The total number of bytes remaining (**TBD**)
125
X-Account-Last-Login        The last login (**TBD**)
126
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
127
X-Account-Group-*           Optional user defined groups
128
X-Account-Meta-*            Optional user defined metadata
129
Last-Modified               The last account modification date (regardless of ``until``)
130
==========================  =====================
131

    
132
|
133

    
134
================  =====================
135
Return Code       Description
136
================  =====================
137
204 (No Content)  The request succeeded
138
================  =====================
139

    
140

    
141
GET
142
"""
143

    
144
====================  ===========================
145
Request Header Name   Value
146
====================  ===========================
147
If-Modified-Since     Retrieve if account has changed since provided timestamp
148
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
149
====================  ===========================
150

    
151
|
152

    
153
======================  =========================
154
Request Parameter Name  Value
155
======================  =========================
156
limit                   The amount of results requested (default is 10000)
157
marker                  Return containers with name lexicographically after marker
158
format                  Optional extended reply type (can be ``json`` or ``xml``)
159
until                   Optional timestamp
160
======================  =========================
161

    
162
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
163
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
164
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
165

    
166
===========================  ============================
167
Name                         Description
168
===========================  ============================
169
name                         The name of the container
170
count                        The number of objects inside the container
171
bytes                        The total size of the objects inside the container
172
last_modified                The last container modification date (regardless of ``until``)
173
x_container_until_timestamp  The last container modification date until the timestamp provided
174
x_container_policy_*         Container behavior and limits
175
x_container_meta_*           Optional user defined metadata
176
===========================  ============================
177

    
178
For examples of container details returned in JSON/XML formats refer to the OOS API documentation.
179

    
180
===========================  =====================
181
Return Code                  Description
182
===========================  =====================
183
200 (OK)                     The request succeeded
184
204 (No Content)             The account has no containers (only for non-extended replies)
185
304 (Not Modified)           The account has not been modified
186
412 (Precondition Failed)    The condition set can not be satisfied
187
===========================  =====================
188

    
189
Will use a ``200`` return code if the reply is of type json/xml.
190

    
191

    
192
POST
193
""""
194

    
195
======================  ============================================
196
Request Parameter Name  Value
197
======================  ============================================
198
update                  Do not replace metadata/groups (no value parameter)
199
======================  ============================================
200

    
201
|
202

    
203
====================  ===========================
204
Request Header Name   Value
205
====================  ===========================
206
X-Account-Group-*     Optional user defined groups
207
X-Account-Meta-*      Optional user defined metadata
208
====================  ===========================
209

    
210
No reply content/headers.
211

    
212
The operation will overwrite all user defined metadata, except if ``update`` is defined.
213
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
214

    
215
================  ===============================
216
Return Code       Description
217
================  ===============================
218
202 (Accepted)    The request has been accepted
219
================  ===============================
220

    
221

    
222
Container Level
223
^^^^^^^^^^^^^^^
224

    
225
List of operations:
226

    
227
=========  ============================
228
Operation  Description
229
=========  ============================
230
HEAD       Retrieve container metadata
231
GET        List objects
232
PUT        Create/update container
233
POST       Update container metadata
234
DELETE     Delete container
235
=========  ============================
236

    
237

    
238
HEAD
239
""""
240

    
241
======================  ===================================
242
Request Parameter Name  Value
243
======================  ===================================
244
until                   Optional timestamp
245
======================  ===================================
246

    
247
|
248

    
249
===========================  ===============================
250
Reply Header Name            Value
251
===========================  ===============================
252
X-Container-Object-Count     The total number of objects in the container
253
X-Container-Bytes-Used       The total number of bytes of all objects stored
254
X-Container-Block-Size       The block size used by the storage backend
255
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
256
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
257
X-Container-Object-Meta      A list with all meta keys used by objects
258
X-Container-Policy-*         Container behavior and limits
259
X-Container-Meta-*           Optional user defined metadata
260
Last-Modified                The last container modification date (regardless of ``until``)
261
===========================  ===============================
262

    
263
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix. See container ``PUT`` for a reference of policy directives.
264

    
265
================  ===============================
266
Return Code       Description
267
================  ===============================
268
204 (No Content)  The request succeeded
269
================  ===============================
270

    
271

    
272
GET
273
"""
274

    
275
====================  ===========================
276
Request Header Name   Value
277
====================  ===========================
278
If-Modified-Since     Retrieve if container has changed since provided timestamp
279
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
280
====================  ===========================
281

    
282
|
283

    
284
======================  ===================================
285
Request Parameter Name  Value
286
======================  ===================================
287
limit                   The amount of results requested (default is 10000)
288
marker                  Return containers with name lexicographically after marker
289
prefix                  Return objects starting with prefix
290
delimiter               Return objects up to the delimiter (discussion follows)
291
path                    Assume ``prefix=path`` and ``delimiter=/``
292
format                  Optional extended reply type (can be ``json`` or ``xml``)
293
meta                    Return objects having the specified meta keys (can be a comma separated list)
294
until                   Optional timestamp
295
======================  ===================================
296

    
297
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
298

    
299
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
300

    
301
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
302
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
303
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
304

    
305
==========================  ======================================
306
Name                        Description
307
==========================  ======================================
308
name                        The name of the object
309
hash                        The ETag of the object
310
bytes                       The size of the object
311
content_type                The MIME content type of the object
312
content_encoding            The encoding of the object (optional)
313
content-disposition         The presentation style of the object (optional)
314
last_modified               The last object modification date (regardless of version)
315
x_object_version            The object's version identifier
316
x_object_version_timestamp  The object's version timestamp
317
x_object_modified_by        The user that committed the object's version
318
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
319
x_object_sharing            Object permissions (optional)
320
x_object_shared_by          Object inheriting permissions (optional)
321
x_object_public             Object's publicly accessible URI (optional)
322
x_object_meta_*             Optional user defined metadata
323
==========================  ======================================
324

    
325
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
326
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
327
In JSON results they appear as dictionaries with only a ``"subdir"`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
328
In case there is an object with the same name as a virtual directory marker, the object will be returned.
329

    
330
For examples of object details returned in JSON/XML formats refer to the OOS API documentation.
331

    
332
===========================  ===============================
333
Return Code                  Description
334
===========================  ===============================
335
200 (OK)                     The request succeeded
336
204 (No Content)             The account has no containers (only for non-extended replies)
337
304 (Not Modified)           The container has not been modified
338
412 (Precondition Failed)    The condition set can not be satisfied
339
===========================  ===============================
340

    
341
Will use a ``200`` return code if the reply is of type json/xml.
342

    
343

    
344
PUT
345
"""
346

    
347
====================  ================================
348
Request Header Name   Value
349
====================  ================================
350
X-Container-Policy-*  Container behavior and limits
351
X-Container-Meta-*    Optional user defined metadata
352
====================  ================================
353
 
354
No reply content/headers.
355

    
356
If no policy is defined, the container will be created with the default values.
357
Available policy directives:
358

    
359
* ``versioning``: Set to ``auto``, ``manual`` or ``none`` (default is ``manual``)
360
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
361
 
362
================  ===============================
363
Return Code       Description
364
================  ===============================
365
201 (Created)     The container has been created
366
202 (Accepted)    The request has been accepted
367
================  ===============================
368

    
369

    
370
POST
371
""""
372

    
373
======================  ============================================
374
Request Parameter Name  Value
375
======================  ============================================
376
update                  Do not replace metadata/policy (no value parameter)
377
======================  ============================================
378

    
379
|
380

    
381
====================  ================================
382
Request Header Name   Value
383
====================  ================================
384
X-Container-Policy-*  Container behavior and limits
385
X-Container-Meta-*    Optional user defined metadata
386
====================  ================================
387

    
388
No reply content/headers.
389

    
390
The operation will overwrite all user defined metadata, except if ``update`` is defined.
391
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
392

    
393
================  ===============================
394
Return Code       Description
395
================  ===============================
396
202 (Accepted)    The request has been accepted
397
================  ===============================
398

    
399

    
400
DELETE
401
""""""
402

    
403
No request parameters/headers.
404

    
405
No reply content/headers.
406

    
407
================  ===============================
408
Return Code       Description
409
================  ===============================
410
204 (No Content)  The request succeeded
411
409 (Conflict)    The container is not empty
412
================  ===============================
413

    
414

    
415
Object Level
416
^^^^^^^^^^^^
417

    
418
List of operations:
419

    
420
=========  =================================
421
Operation  Description
422
=========  =================================
423
HEAD       Retrieve object metadata
424
GET        Read object data
425
PUT        Write object data or copy/move object
426
COPY       Copy object
427
MOVE       Move object
428
POST       Update object metadata/data
429
DELETE     Delete object
430
=========  =================================
431

    
432

    
433
HEAD
434
""""
435

    
436
======================  ===================================
437
Request Parameter Name  Value
438
======================  ===================================
439
version                 Optional version identifier
440
======================  ===================================
441

    
442
|
443

    
444
==========================  ===============================
445
Reply Header Name           Value
446
==========================  ===============================
447
ETag                        The ETag of the object
448
Content-Length              The size of the object
449
Content-Type                The MIME content type of the object
450
Last-Modified               The last object modification date (regardless of version)
451
Content-Encoding            The encoding of the object (optional)
452
Content-Disposition         The presentation style of the object (optional)
453
X-Object-Version            The object's version identifier
454
X-Object-Version-Timestamp  The object's version timestamp
455
X-Object-Modified-By        The user that comitted the object's version
456
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
457
X-Object-Sharing            Object permissions (optional)
458
X-Object-Shared-By          Object inheriting permissions (optional)
459
X-Object-Public             Object's publicly accessible URI (optional)
460
X-Object-Meta-*             Optional user defined metadata
461
==========================  ===============================
462

    
463
|
464

    
465
================  ===============================
466
Return Code       Description
467
================  ===============================
468
200 (No Content)  The request succeeded
469
================  ===============================
470

    
471

    
472
GET
473
"""
474

    
475
====================  ================================
476
Request Header Name   Value
477
====================  ================================
478
Range                 Optional range of data to retrieve
479
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
480
If-Match              Retrieve if ETags match
481
If-None-Match         Retrieve if ETags don't match
482
If-Modified-Since     Retrieve if object has changed since provided timestamp
483
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
484
====================  ================================
485

    
486
|
487

    
488
======================  ===================================
489
Request Parameter Name  Value
490
======================  ===================================
491
format                  Optional extended reply type (can be ``json`` or ``xml``)
492
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
493
======================  ===================================
494

    
495
The reply is the object's data (or part of it), except if a hashmap is requested with the ``format`` parameter, or a version list with ``version=list`` (in which case an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
496

    
497
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block.
498

    
499
Example ``format=json`` reply:
500

    
501
::
502

    
503
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
504

    
505
Example ``format=xml`` reply:
506

    
507
::
508

    
509
  <?xml version="1.0" encoding="UTF-8"?>
510
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
511
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
512
    <hash>...</hash>
513
  </object>
514

    
515
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
516

    
517
Example ``format=json`` reply:
518

    
519
::
520

    
521
  {"versions": [[23, 1307700892], [28, 1307700898], ...]}
522

    
523
Example ``format=xml`` reply:
524

    
525
::
526

    
527
  <?xml version="1.0" encoding="UTF-8"?>
528
  <object name="file">
529
    <version timestamp="1307700892">23</version>
530
    <version timestamp="1307700898">28</version>
531
    <version timestamp="...">...</version>
532
  </object>
533

    
534
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
535

    
536
==========================  ===============================
537
Reply Header Name           Value
538
==========================  ===============================
539
ETag                        The ETag of the object
540
Content-Length              The size of the data returned
541
Content-Type                The MIME content type of the object
542
Content-Range               The range of data included (only on a single range request)
543
Last-Modified               The last object modification date (regardless of version)
544
Content-Encoding            The encoding of the object (optional)
545
Content-Disposition         The presentation style of the object (optional)
546
X-Object-Version            The object's version identifier
547
X-Object-Version-Timestamp  The object's version timestamp
548
X-Object-Modified-By        The user that comitted the object's version
549
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
550
X-Object-Sharing            Object permissions (optional)
551
X-Object-Shared-By          Object inheriting permissions (optional)
552
X-Object-Public             Object's publicly accessible URI (optional)
553
X-Object-Meta-*             Optional user defined metadata
554
==========================  ===============================
555

    
556
|
557

    
558
===========================  ==============================
559
Return Code                  Description
560
===========================  ==============================
561
200 (OK)                     The request succeeded
562
206 (Partial Content)        The range request succeeded
563
304 (Not Modified)           The object has not been modified
564
412 (Precondition Failed)    The condition set can not be satisfied
565
416 (Range Not Satisfiable)  The requested range is out of limits
566
===========================  ==============================
567

    
568

    
569
PUT
570
"""
571

    
572
====================  ================================
573
Request Header Name   Value
574
====================  ================================
575
ETag                  The MD5 hash of the object (optional to check written data)
576
Content-Length        The size of the data written
577
Content-Type          The MIME content type of the object
578
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
579
X-Copy-From           The source path in the form ``/<container>/<object>``
580
X-Move-From           The source path in the form ``/<container>/<object>``
581
X-Source-Version      The source version to copy from
582
Content-Encoding      The encoding of the object (optional)
583
Content-Disposition   The presentation style of the object (optional)
584
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
585
X-Object-Sharing      Object permissions (optional)
586
X-Object-Public       Object is publicly accessible (optional)
587
X-Object-Meta-*       Optional user defined metadata
588
====================  ================================
589

    
590
|
591

    
592
======================  ===================================
593
Request Parameter Name  Value
594
======================  ===================================
595
format                  Optional extended request type (can be ``json``) to create the object by suppling its hashmap instead
596
======================  ===================================
597

    
598
The request is the object's data (or part of it), except if a hashmap is provided with the ``format`` parameter.  If format is used and all different parts are stored in the server, the object is created, otherwise the server returns Conflict (409) with the list of the missing parts. 
599

    
600
Hashmaps expose the underlying storage format of the object.
601

    
602
Example ``format=json`` request:
603

    
604
::
605

    
606
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
607

    
608

    
609
==========================  ===============================
610
Reply Header Name           Value
611
==========================  ===============================
612
ETag                        The MD5 hash of the object (on create)
613
==========================  ===============================
614

    
615
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
616

    
617
===========================  ==============================
618
Return Code                  Description
619
===========================  ==============================
620
201 (Created)                The object has been created
621
409 (Conflict)               The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a conflicting sharing path will be included in the reply - in JSON format)
622
411 (Length Required)        Missing ``Content-Length`` or ``Content-Type`` in the request
623
422 (Unprocessable Entity)   The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
624
===========================  ==============================
625

    
626

    
627
COPY
628
""""
629

    
630
====================  ================================
631
Request Header Name   Value
632
====================  ================================
633
Destination           The destination path in the form ``/<container>/<object>``
634
Content-Type          The MIME content type of the object (optional)
635
Content-Encoding      The encoding of the object (optional)
636
Content-Disposition   The presentation style of the object (optional)
637
X-Source-Version      The source version to copy from
638
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
639
X-Object-Sharing      Object permissions (optional)
640
X-Object-Public       Object is publicly accessible (optional)
641
X-Object-Meta-*       Optional user defined metadata
642
====================  ================================
643

    
644
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
645

    
646
No reply content/headers.
647

    
648
===========================  ==============================
649
Return Code                  Description
650
===========================  ==============================
651
201 (Created)                The object has been created
652
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
653
===========================  ==============================
654

    
655

    
656
MOVE
657
""""
658

    
659
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
660

    
661

    
662
POST
663
""""
664

    
665
======================  ============================================
666
Request Parameter Name  Value
667
======================  ============================================
668
update                  Do not replace metadata (no value parameter)
669
======================  ============================================
670

    
671
|
672

    
673
====================  ================================
674
Request Header Name   Value
675
====================  ================================
676
Content-Length        The size of the data written (optional, to update)
677
Content-Type          The MIME content type of the object (optional, to update)
678
Content-Range         The range of data supplied (optional, to update)
679
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
680
Content-Encoding      The encoding of the object (optional)
681
Content-Disposition   The presentation style of the object (optional)
682
X-Source-Object       Update with data from the object at path ``/<container>/<object>`` (optional, to update)
683
X-Source-Version      The source version to update from (optional, to update)
684
X-Object-Bytes        The updated object's final size (optional, when updating)
685
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
686
X-Object-Sharing      Object permissions (optional)
687
X-Object-Public       Object is publicly accessible (optional)
688
X-Object-Meta-*       Optional user defined metadata
689
====================  ================================
690

    
691
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
692

    
693
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
694

    
695
To update an object's data:
696

    
697
* Either set ``Content-Type`` to ``application/octet-stream``, or provide an object with ``X-Source-Object``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
698
* If the data is supplied in the request (using ``Content-Type`` instead of ``X-Source-Object``), a valid ``Content-Length`` header is required - except if using chunked transfers (set ``Transfer-Encoding`` to ``chunked``).
699
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
700

    
701
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
702
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
703
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data will be appended to the object.
704

    
705
Optionally, truncate the updated object to the desired length with the ``X-Object-Bytes`` header.
706

    
707
A data update will trigger an ETag change. The new ETag will not correspond to the object's MD5 sum (**TBD**) and will be included in reply headers.
708

    
709
No reply content. No reply headers if only metadata is updated.
710

    
711
==========================  ===============================
712
Reply Header Name           Value
713
==========================  ===============================
714
ETag                        The new ETag of the object (data updated)
715
==========================  ===============================
716

    
717
|
718

    
719
===========================  ==============================
720
Return Code                  Description
721
===========================  ==============================
722
202 (Accepted)               The request has been accepted (not a data update)
723
204 (No Content)             The request succeeded (data updated)
724
409 (Conflict)               There are conflicting permissions (a conflicting sharing path will be included in the reply - in JSON format)
725
411 (Length Required)        Missing ``Content-Length`` in the request
726
416 (Range Not Satisfiable)  The supplied range is invalid
727
===========================  ==============================
728

    
729
The ``POST`` method can also be used for creating an object via a standard HTML form. If the request ``Content-Type`` is ``multipart/form-data``, none of the above headers will be processed. The form should have exactly two fields, as in the following example. ::
730

    
731
  <form method="post" action="https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt" enctype="multipart/form-data">
732
    <input type="hidden" name="X-Auth-Token" value="0000">
733
    <input type="file" name="X-Object-Data">
734
    <input type="submit">
735
  </form>
736

    
737
This will create/override the object with the given name, as if using ``PUT``. The ``Content-Type`` of the object will be set to the value of the corresponding header sent in the part of the request containing the data. Metadata, sharing and other object attributes can not be set this way.
738

    
739
==========================  ===============================
740
Reply Header Name           Value
741
==========================  ===============================
742
ETag                        The MD5 hash of the object
743
==========================  ===============================
744

    
745
|
746

    
747
===========================  ==============================
748
Return Code                  Description
749
===========================  ==============================
750
201 (Created)                The object has been created
751
===========================  ==============================
752

    
753

    
754
DELETE
755
""""""
756

    
757
No request parameters/headers.
758

    
759
No reply content/headers.
760

    
761
===========================  ==============================
762
Return Code                  Description
763
===========================  ==============================
764
204 (No Content)             The request succeeded
765
===========================  ==============================
766

    
767
Sharing and Public Objects
768
^^^^^^^^^^^^^^^^^^^^^^^^^^
769

    
770
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
771

    
772
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden).
773

    
774
==========================  ===============================
775
Reply Header Name           Value
776
==========================  ===============================
777
ETag                        The ETag of the object
778
Content-Length              The size of the data returned
779
Content-Type                The MIME content type of the object
780
Content-Range               The range of data included (only on a single range request)
781
Last-Modified               The last object modification date (regardless of version)
782
Content-Encoding            The encoding of the object (optional)
783
Content-Disposition         The presentation style of the object (optional)
784
==========================  ===============================
785

    
786
Summary
787
^^^^^^^
788

    
789
List of differences from the OOS API:
790

    
791
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
792
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
793
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings.
794
* Container policies to manage behavior and limits.
795
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
796
* All metadata replies, at all levels, include latest modification information.
797
* At all levels, a ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
798
* Container/object lists include all associated metadata if the reply is of type json/xml. Some names are kept to their OOS API equivalents for compatibility. 
799
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
800
* Multi-range object ``GET`` support as outlined in RFC2616.
801
* Object hashmap retrieval through ``GET`` and the ``format`` parameter.
802
* Object create via hashmap through ``PUT`` and the ``format`` parameter.
803
* Object create using ``POST`` to support standard HTML forms.
804
* Partial object updates through ``POST``, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers. Use another object's data to update with ``X-Source-Object`` and ``X-Source-Version``. Truncate with ``X-Object-Bytes``.
805
* Object ``MOVE`` support.
806
* Time-variant account/container listings via the ``until`` parameter.
807
* Object versions - parameter ``version`` in ``HEAD``/``GET`` (list versions with ``GET``), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in ``PUT``/``COPY``.
808
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
809
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
810
* Large object support with ``X-Object-Manifest``.
811
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
812

    
813
Clarifications/suggestions:
814

    
815
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
816
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
817
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
818
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
819
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired reply format. The parameter overrides the header.
820
* Container/object lists use a ``200`` return code if the reply is of type json/xml. The reply will include an empty json/xml.
821
* In headers, dates are formatted according to RFC 1123. In extended information listings, dates are formatted according to ISO 8601.
822
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
823
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
824
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
825

    
826
The Pithos Client
827
-----------------
828

    
829
User Experience
830
^^^^^^^^^^^^^^^
831

    
832
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
833

    
834
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
835

    
836
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
837
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
838
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
839
* The ``others`` element, which contains all objects that other users share with the user.
840
* The ``tags`` element, which lists the names of tags the user has defined. This can be an entry point to list all files that have been assigned a specific tag or manage tags in general (remove a tag completely, rename a tag etc.).
841
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
842

    
843
Objects in Pithos can be:
844

    
845
* Assigned custom tags.
846
* Moved to trash and then deleted.
847
* Shared with specific permissions.
848
* Made public (shared with non-Pithos users).
849
* Restored from previous versions.
850

    
851
Some of these functions are performed by the client software and some by the Pithos server. Client-driven functionality is based on specific metadata that should be handled equally across implementations. These metadata names are discussed in the next chapter. 
852

    
853
Conventions and Metadata Specification
854
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
855

    
856
Pithos clients should use the ``pithos`` container for all Pithos objects. Object names use the ``/`` delimiter to impose a hierarchy of folders and files.
857

    
858
At the object level, tags are implemented by managing metadata keys. The client software should allow the user to use any string as a tag and then set the corresponding ``X-Object-Meta-<tag>`` key at the server. The API extensions provided, allow for listing all tags in a container and filtering object listings based on one or more tags. The tag list is sufficient for implementing the ``tags`` element, either as a special, virtual folder (as done in the first version of Pithos), or as an application menu.
859

    
860
The metadata specification is summarized in the following table.
861

    
862
===========================  ==============================
863
Metadata Name                Value
864
===========================  ==============================
865
X-Object-Meta-*              Use for other tags that apply to the object
866
===========================  ==============================
867

    
868
Recommended Practices and Examples
869
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
870

    
871
Assuming an authentication token is obtained (**TBD**), the following high-level operations are available - shown with ``curl``:
872

    
873
* Get account information ::
874

    
875
    curl -X HEAD -D - \
876
         -H "X-Auth-Token: 0000" \
877
         https://pithos.dev.grnet.gr/v1/user
878

    
879
* List available containers ::
880

    
881
    curl -X GET -D - \
882
         -H "X-Auth-Token: 0000" \
883
         https://pithos.dev.grnet.gr/v1/user
884

    
885
* Get container information ::
886

    
887
    curl -X HEAD -D - \
888
         -H "X-Auth-Token: 0000" \
889
         https://pithos.dev.grnet.gr/v1/user/pithos
890

    
891
* Add a new container ::
892

    
893
    curl -X PUT -D - \
894
         -H "X-Auth-Token: 0000" \
895
         https://pithos.dev.grnet.gr/v1/user/test
896

    
897
* Delete a container ::
898

    
899
    curl -X DELETE -D - \
900
         -H "X-Auth-Token: 0000" \
901
         https://pithos.dev.grnet.gr/v1/user/test
902

    
903
* List objects in a container ::
904

    
905
    curl -X GET -D - \
906
         -H "X-Auth-Token: 0000" \
907
         https://pithos.dev.grnet.gr/v1/user/pithos
908

    
909
* List objects in a container (extended reply) ::
910

    
911
    curl -X GET -D - \
912
         -H "X-Auth-Token: 0000" \
913
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
914

    
915
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
916

    
917
* List metadata keys used by objects in a container
918

    
919
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``).
920

    
921
* List objects in a container having a specific meta defined ::
922

    
923
    curl -X GET -D - \
924
         -H "X-Auth-Token: 0000" \
925
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=favorites
926

    
927
* Retrieve an object ::
928

    
929
    curl -X GET -D - \
930
         -H "X-Auth-Token: 0000" \
931
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
932

    
933
* Retrieve an object (specific ranges of data) ::
934

    
935
    curl -X GET -D - \
936
         -H "X-Auth-Token: 0000" \
937
         -H "Range: bytes=0-9" \
938
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
939

    
940
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
941

    
942
* Add a new object (folder type) (**TBD**) ::
943

    
944
    curl -X PUT -D - \
945
         -H "X-Auth-Token: 0000" \
946
         -H "Content-Type: application/folder" \
947
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
948

    
949
* Add a new object ::
950

    
951
    curl -X PUT -D - \
952
         -H "X-Auth-Token: 0000" \
953
         -H "Content-Type: text/plain" \
954
         -T EXAMPLE.txt
955
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
956

    
957
* Update an object ::
958

    
959
    curl -X POST -D - \
960
         -H "X-Auth-Token: 0000" \
961
         -H "Content-Length: 10" \
962
         -H "Content-Type: application/octet-stream" \
963
         -H "Content-Range: bytes 10-19/*" \
964
         -d "0123456789" \
965
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
966

    
967
  This will update bytes 10-19 with the data specified.
968

    
969
* Update an object (append) ::
970

    
971
    curl -X POST -D - \
972
         -H "X-Auth-Token: 0000" \
973
         -H "Content-Length: 10" \
974
         -H "Content-Type: application/octet-stream" \
975
         -H "Content-Range: bytes */*" \
976
         -d "0123456789" \
977
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
978

    
979
* Update an object (truncate) ::
980

    
981
    curl -X POST -D - \
982
         -H "X-Auth-Token: 0000" \
983
         -H "X-Source-Object: /folder/EXAMPLE.txt" \
984
         -H "Content-Range: bytes 0-0/*" \
985
         -H "X-Object-Bytes: 0" \
986
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
987

    
988
  This will truncate the object to 0 bytes.
989

    
990
* Add object metadata ::
991

    
992
    curl -X POST -D - \
993
         -H "X-Auth-Token: 0000" \
994
         -H "X-Object-Meta-First: first_meta_value" \
995
         -H "X-Object-Meta-Second: second_meta_value" \
996
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
997

    
998
* Delete object metadata ::
999

    
1000
    curl -X POST -D - \
1001
         -H "X-Auth-Token: 0000" \
1002
         -H "X-Object-Meta-First: first_meta_value" \
1003
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1004

    
1005
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
1006

    
1007
* Delete an object ::
1008

    
1009
    curl -X DELETE -D - \
1010
         -H "X-Auth-Token: 0000" \
1011
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt