Statistics
| Branch: | Tag: | Revision:

root / docs / source / devguide.rst @ af7bb62f

History | View | Annotate | Download (70.3 kB)

1
Pithos v2 Developer Guide
2
=========================
3

    
4
Introduction
5
------------
6

    
7
Pithos is a storage service implemented by GRNET (http://www.grnet.gr). Data is stored as objects, organized in containers, belonging to an account. This hierarchy of storage layers has been inspired by the OpenStack Object Storage (OOS) API and similar CloudFiles API by Rackspace. The Pithos API follows the OOS API as closely as possible. One of the design requirements has been to be able to use Pithos with clients built for the OOS, without changes.
8

    
9
However, to be able to take full advantage of the Pithos infrastructure, client software should be aware of the extensions that differentiate Pithos from OOS. Pithos objects can be updated, or appended to. Pithos will store sharing permissions per object and enforce corresponding authorization policies. Automatic version management, allows taking account and container listings back in time, as well as reading previous instances of objects.
10

    
11
The storage backend of Pithos is block oriented, permitting efficient, deduplicated data placement. The block structure of objects is exposed at the API layer, in order to encourage external software to implement advanced data management operations.
12

    
13
This document's goals are:
14

    
15
* Define the Pithos ReST API that allows the storage and retrieval of data and metadata via HTTP calls
16
* Specify metadata semantics and user interface guidelines for a common experience across client software implementations
17

    
18
The present document is meant to be read alongside the OOS API documentation. Thus, it is suggested that the reader is familiar with associated technologies, the OOS API as well as the first version of the Pithos API. This document refers to the second version of Pithos. Information on the first version of the storage API can be found at http://code.google.com/p/gss.
19

    
20
Whatever marked as to be determined (**TBD**), should not be considered by implementors.
21

    
22
More info about Pithos can be found here: https://code.grnet.gr/projects/pithos
23

    
24
Document Revisions
25
^^^^^^^^^^^^^^^^^^
26

    
27
=========================  ================================
28
Revision                   Description
29
=========================  ================================
30
0.8 (Jan 24, 2012)         Update allowed versioning values.
31
\                          Change policy/meta formatting in JSON/XML replies.
32
\                          Document that all non-ASCII characters in headers should be URL-encoded.
33
\                          Support metadata-based queries when listing objects at the container level.
34
\                          Note Content-Type issue when using the internal django web server.
35
\                          Add object UUID field.
36
\                          Always reply with the MD5 in the ETag.
37
\                          Note that ``/login`` will only work if an external authentication system is defined.
38
\                          Include option to ignore Content-Type on ``COPY``/``MOVE``.
39
\                          Use format parameter for conflict (409) and uploaded hash list (container level) replies.
40
0.7 (Nov 21, 2011)         Suggest upload/download methods using hashmaps.
41
\                          Propose syncing algorithm.
42
\                          Support cross-account object copy and move.
43
\                          Pass token as a request parameter when using ``POST`` via an HTML form.
44
\                          Optionally use source account to update object from another object.
45
\                          Use container ``POST`` to upload missing blocks of data.
46
\                          Report policy in account headers.
47
\                          Add insufficient quota reply.
48
\                          Use special meta to always report Merkle hash.
49
0.6 (Sept 13, 2011)        Reply with Merkle hash as the ETag when updating objects.
50
\                          Include version id in object replace/change replies.
51
\                          Change conflict (409) replies format to text.
52
\                          Tags should be migrated to a meta value.
53
\                          Container ``PUT`` updates metadata/policy.
54
\                          Report allowed actions in shared object replies.
55
\                          Provide ``https://hostname/login`` for Shibboleth authentication.
56
\                          Use ``hashmap`` parameter in object ``GET``/``PUT`` to use hashmaps.
57
0.5 (July 22, 2011)        Object update from another object's data.
58
\                          Support object truncate.
59
\                          Create object using a standard HTML form.
60
\                          Purge container/object history.
61
\                          List other accounts that share objects with a user.
62
\                          List shared containers/objects.
63
\                          Update implementation guidelines.
64
\                          Check preconditions when creating/updating objects.
65
0.4 (July 01, 2011)        Object permissions and account groups.
66
\                          Control versioning behavior and container quotas with container policy directives.
67
\                          Support updating/deleting individual metadata with ``POST``.
68
\                          Create object using hashmap.
69
0.3 (June 14, 2011)        Large object support with ``X-Object-Manifest``.
70
\                          Allow for publicly available objects via ``https://hostname/public``.
71
\                          Support time-variant account/container listings. 
72
\                          Add source version when duplicating with ``PUT``/``COPY``.
73
\                          Request version in object ``HEAD``/``GET`` requests (list versions with ``GET``).
74
0.2 (May 31, 2011)         Add object meta listing and filtering in containers.
75
\                          Include underlying storage characteristics in container meta.
76
\                          Support for partial object updates through ``POST``.
77
\                          Expose object hashmaps through ``GET``.
78
\                          Support for multi-range object ``GET`` requests.
79
0.1 (May 17, 2011)         Initial release. Based on OpenStack Object Storage Developer Guide API v1 (Apr. 15, 2011).
80
=========================  ================================
81

    
82
Pithos Users and Authentication
83
-------------------------------
84

    
85
In Pithos, each user is uniquely identified by a token. All API requests require a token and each token is internally resolved to an account string. The API uses the account string to identify the user's own files, thus whether a request is local or cross-account.
86

    
87
Pithos does not keep a user database. For development and testing purposes, user identifiers and their corresponding tokens can be defined in the settings file. However, Pithos is designed with an external authentication service in mind. This service must handle the details of validating user credentials and communicate with Pithos via a middleware software component that, given a token, fills in the internal request account variable.
88

    
89
Client software using Pithos, if not already knowing a user's identifier and token, should forward to the ``/login`` URI. The Pithos server, depending on its configuration will redirect to the appropriate login page.
90

    
91
The login URI accepts the following parameters:
92

    
93
======================  =========================
94
Request Parameter Name  Value
95
======================  =========================
96
next                    The URI to redirect to when the process is finished
97
renew                   Force token renewal (no value parameter)
98
======================  =========================
99

    
100
When done with logging in, the service's login URI should redirect to the URI provided with ``next``, adding ``user`` and ``token`` parameters, which contain the account and token fields respectively.
101

    
102
A user management service that implements a login URI according to these conventions is Astakos (https://code.grnet.gr/projects/astakos), by GRNET.
103

    
104
The Pithos API
105
--------------
106

    
107
The URI requests supported by the Pithos API follow one of the following forms:
108

    
109
* Top level: ``https://hostname/v1/``
110
* Account level: ``https://hostname/v1/<account>``
111
* Container level: ``https://hostname/v1/<account>/<container>``
112
* Object level: ``https://hostname/v1/<account>/<container>/<object>``
113

    
114
All requests must include an ``X-Auth-Token`` - as a header, or a parameter.
115

    
116
The allowable request operations and respective return codes per level are presented in the remainder of this chapter. Common to all requests are the following return codes.
117

    
118
=========================  ================================
119
Return Code                Description
120
=========================  ================================
121
400 (Bad Request)          The request is invalid
122
401 (Unauthorized)         Missing or invalid token
123
403 (Forbidden)            Request not allowed
124
404 (Not Found)            The requested resource was not found
125
503 (Service Unavailable)  The request cannot be completed because of an internal error
126
=========================  ================================
127

    
128
Top Level
129
^^^^^^^^^
130

    
131
List of operations:
132

    
133
=========  ==================
134
Operation  Description
135
=========  ==================
136
GET        Authentication (for compatibility with the OOS API) or list allowed accounts
137
=========  ==================
138

    
139
GET
140
"""
141

    
142
If the ``X-Auth-User`` and ``X-Auth-Key`` headers are given, a dummy ``X-Auth-Token`` and ``X-Storage-Url`` will be replied, which can be used as a guest token/namespace for testing Pithos.
143

    
144
================  =====================
145
Return Code       Description
146
================  =====================
147
204 (No Content)  The request succeeded
148
================  =====================
149

    
150
If an ``X-Auth-Token`` is already present, the operation will be interpreted as a request to list other accounts that share objects to the user.
151

    
152
======================  =========================
153
Request Parameter Name  Value
154
======================  =========================
155
limit                   The amount of results requested (default is 10000)
156
marker                  Return containers with name lexicographically after marker
157
format                  Optional extended reply type (can be ``json`` or ``xml``)
158
======================  =========================
159

    
160
The reply is a list of account names.
161
If a ``format=xml`` or ``format=json`` argument is given, extended information on the accounts will be returned, serialized in the chosen format.
162
For each account, the information will include the following (names will be in lower case and with hyphens replaced with underscores):
163

    
164
===========================  ============================
165
Name                         Description
166
===========================  ============================
167
name                         The name of the account
168
last_modified                The last account modification date (regardless of ``until``)
169
===========================  ============================
170

    
171
Example ``format=json`` reply:
172

    
173
::
174

    
175
  [{"name": "user", "last_modified": "2011-12-02T08:10:41.565891+00:00"}, ...]
176

    
177
Example ``format=xml`` reply:
178

    
179
::
180

    
181
  <?xml version="1.0" encoding="UTF-8"?>
182
  <accounts>
183
    <account>
184
      <name>user</name>
185
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
186
    </account>
187
    <account>...</account>
188
  </accounts>
189

    
190
===========================  =====================
191
Return Code                  Description
192
===========================  =====================
193
200 (OK)                     The request succeeded
194
204 (No Content)             The user has no access to other accounts (only for non-extended replies)
195
===========================  =====================
196

    
197
Will use a ``200`` return code if the reply is of type JSON/XML.
198

    
199
Account Level
200
^^^^^^^^^^^^^
201

    
202
List of operations:
203

    
204
=========  ==================
205
Operation  Description
206
=========  ==================
207
HEAD       Retrieve account metadata
208
GET        List containers
209
POST       Update account metadata
210
=========  ==================
211

    
212
HEAD
213
""""
214

    
215
====================  ===========================
216
Request Header Name   Value
217
====================  ===========================
218
If-Modified-Since     Retrieve if account has changed since provided timestamp
219
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
220
====================  ===========================
221

    
222
|
223

    
224
======================  ===================================
225
Request Parameter Name  Value
226
======================  ===================================
227
until                   Optional timestamp
228
======================  ===================================
229

    
230
Cross-user requests are not allowed to use ``until`` and only include the account modification date in the reply.
231

    
232
==========================  =====================
233
Reply Header Name           Value
234
==========================  =====================
235
X-Account-Container-Count   The total number of containers
236
X-Account-Bytes-Used        The total number of bytes stored
237
X-Account-Until-Timestamp   The last account modification date until the timestamp provided
238
X-Account-Group-*           Optional user defined groups
239
X-Account-Policy-*          Account behavior and limits
240
X-Account-Meta-*            Optional user defined metadata
241
Last-Modified               The last account modification date (regardless of ``until``)
242
==========================  =====================
243

    
244
|
245

    
246
================  =====================
247
Return Code       Description
248
================  =====================
249
204 (No Content)  The request succeeded
250
================  =====================
251

    
252

    
253
GET
254
"""
255

    
256
====================  ===========================
257
Request Header Name   Value
258
====================  ===========================
259
If-Modified-Since     Retrieve if account has changed since provided timestamp
260
If-Unmodified-Since   Retrieve if account has not changed since provided timestamp
261
====================  ===========================
262

    
263
|
264

    
265
======================  =========================
266
Request Parameter Name  Value
267
======================  =========================
268
limit                   The amount of results requested (default is 10000)
269
marker                  Return containers with name lexicographically after marker
270
format                  Optional extended reply type (can be ``json`` or ``xml``)
271
shared                  Show only shared containers (no value parameter)
272
until                   Optional timestamp
273
======================  =========================
274

    
275
The reply is a list of container names. Account headers (as in a ``HEAD`` request) will also be included.
276
Cross-user requests are not allowed to use ``until`` and only include the account/container modification dates in the reply.
277

    
278
If a ``format=xml`` or ``format=json`` argument is given, extended information on the containers will be returned, serialized in the chosen format.
279
For each container, the information will include all container metadata (names will be in lower case and with hyphens replaced with underscores):
280

    
281
===========================  ============================
282
Name                         Description
283
===========================  ============================
284
name                         The name of the container
285
count                        The number of objects inside the container
286
bytes                        The total size of the objects inside the container
287
last_modified                The last container modification date (regardless of ``until``)
288
x_container_until_timestamp  The last container modification date until the timestamp provided
289
x_container_policy_*         Container behavior and limits
290
x_container_meta_*           Optional user defined metadata
291
===========================  ============================
292

    
293
Example ``format=json`` reply:
294

    
295
::
296

    
297
  [{"name": "pithos",
298
    "bytes": 62452,
299
    "count": 8374,
300
    "last_modified": "2011-12-02T08:10:41.565891+00:00",
301
    "x_container_policy": {"quota": "53687091200", "versioning": "auto"},
302
    "x_container_meta": {"a": "b", "1": "2"}}, ...]
303

    
304
Example ``format=xml`` reply:
305

    
306
::
307

    
308
  <?xml version="1.0" encoding="UTF-8"?>
309
  <account name="user">
310
    <container>
311
      <name>pithos</name>
312
      <bytes>62452</bytes>
313
      <count>8374</count>
314
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
315
      <x_container_policy>
316
        <key>quota</key><value>53687091200</value>
317
        <key>versioning</key><value>auto</value>
318
      </x_container_policy>
319
      <x_container_meta>
320
        <key>a</key><value>b</value>
321
        <key>1</key><value>2</value>
322
      </x_container_meta>
323
    </container>
324
    <container>...</container>
325
  </account>
326

    
327
For more examples of container details returned in JSON/XML formats refer to the OOS API documentation. In addition to the OOS API, Pithos returns all fields. Policy and metadata values are grouped and returned as key-value pairs.
328

    
329
===========================  =====================
330
Return Code                  Description
331
===========================  =====================
332
200 (OK)                     The request succeeded
333
204 (No Content)             The account has no containers (only for non-extended replies)
334
304 (Not Modified)           The account has not been modified
335
412 (Precondition Failed)    The condition set can not be satisfied
336
===========================  =====================
337

    
338
Will use a ``200`` return code if the reply is of type JSON/XML.
339

    
340

    
341
POST
342
""""
343

    
344
====================  ===========================
345
Request Header Name   Value
346
====================  ===========================
347
X-Account-Group-*     Optional user defined groups
348
X-Account-Meta-*      Optional user defined metadata
349
====================  ===========================
350

    
351
|
352

    
353
======================  ============================================
354
Request Parameter Name  Value
355
======================  ============================================
356
update                  Do not replace metadata/groups (no value parameter)
357
======================  ============================================
358

    
359
No reply content/headers.
360

    
361
The operation will overwrite all user defined metadata, except if ``update`` is defined.
362
To create a group, include an ``X-Account-Group-*`` header with the name in the key and a comma separated list of user names in the value. If no ``X-Account-Group-*`` header is present, no changes will be applied to groups. The ``update`` parameter also applies to groups. To delete a specific group, use ``update`` and an empty header value.
363

    
364
================  ===============================
365
Return Code       Description
366
================  ===============================
367
202 (Accepted)    The request has been accepted
368
================  ===============================
369

    
370

    
371
Container Level
372
^^^^^^^^^^^^^^^
373

    
374
List of operations:
375

    
376
=========  ============================
377
Operation  Description
378
=========  ============================
379
HEAD       Retrieve container metadata
380
GET        List objects
381
PUT        Create/update container
382
POST       Update container metadata
383
DELETE     Delete container
384
=========  ============================
385

    
386

    
387
HEAD
388
""""
389

    
390
====================  ===========================
391
Request Header Name   Value
392
====================  ===========================
393
If-Modified-Since     Retrieve if container has changed since provided timestamp
394
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
395
====================  ===========================
396

    
397
|
398

    
399
======================  ===================================
400
Request Parameter Name  Value
401
======================  ===================================
402
until                   Optional timestamp
403
======================  ===================================
404

    
405
Cross-user requests are not allowed to use ``until`` and only include the container modification date in the reply.
406

    
407
===========================  ===============================
408
Reply Header Name            Value
409
===========================  ===============================
410
X-Container-Object-Count     The total number of objects in the container
411
X-Container-Bytes-Used       The total number of bytes of all objects stored
412
X-Container-Block-Size       The block size used by the storage backend
413
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
414
X-Container-Until-Timestamp  The last container modification date until the timestamp provided
415
X-Container-Object-Meta      A list with all meta keys used by objects (**TBD**)
416
X-Container-Policy-*         Container behavior and limits
417
X-Container-Meta-*           Optional user defined metadata
418
Last-Modified                The last container modification date (regardless of ``until``)
419
===========================  ===============================
420

    
421
The keys returned in ``X-Container-Object-Meta`` are all the unique strings after the ``X-Object-Meta-`` prefix, formatted as a comma-separated list. See container ``PUT`` for a reference of policy directives. (**TBD**)
422

    
423
================  ===============================
424
Return Code       Description
425
================  ===============================
426
204 (No Content)  The request succeeded
427
================  ===============================
428

    
429

    
430
GET
431
"""
432

    
433
====================  ===========================
434
Request Header Name   Value
435
====================  ===========================
436
If-Modified-Since     Retrieve if container has changed since provided timestamp
437
If-Unmodified-Since   Retrieve if container has not changed since provided timestamp
438
====================  ===========================
439

    
440
|
441

    
442
======================  ===================================
443
Request Parameter Name  Value
444
======================  ===================================
445
limit                   The amount of results requested (default is 10000)
446
marker                  Return containers with name lexicographically after marker
447
prefix                  Return objects starting with prefix
448
delimiter               Return objects up to the delimiter (discussion follows)
449
path                    Assume ``prefix=path`` and ``delimiter=/``
450
format                  Optional extended reply type (can be ``json`` or ``xml``)
451
meta                    Return objects that satisfy the key queries in the specified comma separated list (use ``<key>``, ``!<key>`` for existence queries, ``<key><op><value>`` for value queries, where ``<op>`` can be one of ``=``, ``!=``, ``<=``, ``>=``, ``<``, ``>``)
452
shared                  Show only shared objects (no value parameter)
453
until                   Optional timestamp
454
======================  ===================================
455

    
456
The ``path`` parameter overrides ``prefix`` and ``delimiter``. When using ``path``, results will include objects ending in ``delimiter``.
457

    
458
The keys given with ``meta`` will be matched with the strings after the ``X-Object-Meta-`` prefix.
459

    
460
The reply is a list of object names. Container headers (as in a ``HEAD`` request) will also be included.
461
Cross-user requests are not allowed to use ``until`` and include the following limited set of headers in the reply:
462

    
463
===========================  ===============================
464
Reply Header Name            Value
465
===========================  ===============================
466
X-Container-Block-Size       The block size used by the storage backend
467
X-Container-Block-Hash       The hash algorithm used for block identifiers in object hashmaps
468
X-Container-Object-Meta      A list with all meta keys used by allowed objects (**TBD**)
469
Last-Modified                The last container modification date
470
===========================  ===============================
471

    
472
If a ``format=xml`` or ``format=json`` argument is given, extended information on the objects will be returned, serialized in the chosen format.
473
For each object, the information will include all object metadata (names will be in lower case and with hyphens replaced with underscores):
474

    
475
==========================  ======================================
476
Name                        Description
477
==========================  ======================================
478
name                        The name of the object
479
hash                        The ETag of the object
480
bytes                       The size of the object
481
content_type                The MIME content type of the object
482
content_encoding            The encoding of the object (optional)
483
content-disposition         The presentation style of the object (optional)
484
last_modified               The last object modification date (regardless of version)
485
x_object_hash               The Merkle hash
486
x_object_uuid               The object's UUID
487
x_object_version            The object's version identifier
488
x_object_version_timestamp  The object's version timestamp
489
x_object_modified_by        The user that committed the object's version
490
x_object_manifest           Object parts prefix in ``<container>/<object>`` form (optional)
491
x_object_sharing            Object permissions (optional)
492
x_object_shared_by          Object inheriting permissions (optional)
493
x_object_allowed_to         Allowed actions on object (optional)
494
x_object_public             Object's publicly accessible URI (optional)
495
x_object_meta_*             Optional user defined metadata
496
==========================  ======================================
497

    
498
Sharing metadata will only be returned if there is no ``until`` parameter defined.
499

    
500
Extended replies may also include virtual directory markers in separate sections of the ``json`` or ``xml`` results.
501
Virtual directory markers are only included when ``delimiter`` is explicitly set. They correspond to the substrings up to and including the first occurrence of the delimiter.
502
In JSON results they appear as dictionaries with only a ``subdir`` key. In XML results they appear interleaved with ``<object>`` tags as ``<subdir name="..." />``.
503
In case there is an object with the same name as a virtual directory marker, the object will be returned.
504

    
505
Example ``format=json`` reply:
506

    
507
::
508

    
509
  [{"name": "object",
510
    "bytes": 0,
511
    "hash": "d41d8cd98f00b204e9800998ecf8427e",
512
    "content_type": "application/octet-stream",
513
    "last_modified": "2011-12-02T08:10:41.565891+00:00",
514
    "x_object_meta": {"asdf": "qwerty"},
515
    "x_object_hash": "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855",
516
    "x_object_uuid": "8ed9af1b-c948-4bb6-82b0-48344f5c822c",
517
    "x_object_version": 98,
518
    "x_object_version_timestamp": "1322813441.565891",
519
    "x_object_modified_by": "user"}, ...]
520

    
521
Example ``format=xml`` reply:
522

    
523
::
524

    
525
  <?xml version="1.0" encoding="UTF-8"?>
526
  <container name="pithos">
527
    <object>
528
      <name>object</name>
529
      <bytes>0</bytes>
530
      <hash>d41d8cd98f00b204e9800998ecf8427e</hash>
531
      <content_type>application/octet-stream</content_type>
532
      <last_modified>2011-12-02T08:10:41.565891+00:00</last_modified>
533
      <x_object_meta>
534
        <key>asdf</key><value>qwerty</value>
535
      </x_object_meta>
536
      <x_object_hash>e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855</x_object_hash>
537
      <x_object_uuid>8ed9af1b-c948-4bb6-82b0-48344f5c822c</x_object_uuid>
538
      <x_object_version>98</x_object_version>
539
      <x_object_version_timestamp>1322813441.565891</x_object_version_timestamp>
540
      <x_object_modified_by>chazapis</x_object_modified_by>
541
    </object>
542
    <object>...</object>
543
  </container>
544

    
545
For more examples of container details returned in JSON/XML formats refer to the OOS API documentation. In addition to the OOS API, Pithos returns all fields. Metadata values are grouped and returned as key-value pairs.
546

    
547
===========================  ===============================
548
Return Code                  Description
549
===========================  ===============================
550
200 (OK)                     The request succeeded
551
204 (No Content)             The account has no containers (only for non-extended replies)
552
304 (Not Modified)           The container has not been modified
553
412 (Precondition Failed)    The condition set can not be satisfied
554
===========================  ===============================
555

    
556
Will use a ``200`` return code if the reply is of type JSON/XML.
557

    
558

    
559
PUT
560
"""
561

    
562
====================  ================================
563
Request Header Name   Value
564
====================  ================================
565
X-Container-Policy-*  Container behavior and limits
566
X-Container-Meta-*    Optional user defined metadata
567
====================  ================================
568
 
569
No reply content/headers.
570

    
571
If no policy is defined, the container will be created with the default values.
572
Available policy directives:
573

    
574
* ``versioning``: Set to ``auto`` or ``none`` (default is ``auto``)
575
* ``quota``: Size limit in KB (default is ``0`` - unlimited)
576

    
577
If the container already exists, the operation is equal to a ``POST`` with ``update`` defined.
578

    
579
================  ===============================
580
Return Code       Description
581
================  ===============================
582
201 (Created)     The container has been created
583
202 (Accepted)    The request has been accepted
584
================  ===============================
585

    
586

    
587
POST
588
""""
589

    
590
====================  ================================
591
Request Header Name   Value
592
====================  ================================
593
Content-Length        The size of the supplied data (optional, to upload)
594
Content-Type          The MIME content type of the supplied data (optional, to upload)
595
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
596
X-Container-Policy-*  Container behavior and limits
597
X-Container-Meta-*    Optional user defined metadata
598
====================  ================================
599

    
600
|
601

    
602
======================  ============================================
603
Request Parameter Name  Value
604
======================  ============================================
605
format                  Optional hash list reply type (can be ``json`` or ``xml``)
606
update                  Do not replace metadata/policy (no value parameter)
607
======================  ============================================
608

    
609
No reply content/headers, except when uploading data, where the reply consists of a list of hashes for the blocks received (in the format specified).
610

    
611
The operation will overwrite all user defined metadata, except if ``update`` is defined.
612
To change policy, include an ``X-Container-Policy-*`` header with the name in the key. If no ``X-Container-Policy-*`` header is present, no changes will be applied to policy. The ``update`` parameter also applies to policy - deleted values will revert to defaults. To delete/revert a specific policy directive, use ``update`` and an empty header value. See container ``PUT`` for a reference of policy directives.
613

    
614
To upload blocks of data to the container, set ``Content-Type`` to ``application/octet-stream`` and ``Content-Length`` to a valid value (except if using ``chunked`` as the ``Transfer-Encoding``).
615

    
616
================  ===============================
617
Return Code       Description
618
================  ===============================
619
202 (Accepted)    The request has been accepted
620
================  ===============================
621

    
622

    
623
DELETE
624
""""""
625

    
626
======================  ===================================
627
Request Parameter Name  Value
628
======================  ===================================
629
until                   Optional timestamp
630
======================  ===================================
631

    
632
If ``until`` is defined, the container is "purged" up to that time (the history of all objects up to then is deleted).
633

    
634
No reply content/headers.
635

    
636
================  ===============================
637
Return Code       Description
638
================  ===============================
639
204 (No Content)  The request succeeded
640
409 (Conflict)    The container is not empty
641
================  ===============================
642

    
643

    
644
Object Level
645
^^^^^^^^^^^^
646

    
647
List of operations:
648

    
649
=========  =================================
650
Operation  Description
651
=========  =================================
652
HEAD       Retrieve object metadata
653
GET        Read object data
654
PUT        Write object data or copy/move object
655
COPY       Copy object
656
MOVE       Move object
657
POST       Update object metadata/data
658
DELETE     Delete object
659
=========  =================================
660

    
661

    
662
HEAD
663
""""
664

    
665
====================  ================================
666
Request Header Name   Value
667
====================  ================================
668
If-Match              Retrieve if ETags match
669
If-None-Match         Retrieve if ETags don't match
670
If-Modified-Since     Retrieve if object has changed since provided timestamp
671
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
672
====================  ================================
673

    
674
|
675

    
676
======================  ===================================
677
Request Parameter Name  Value
678
======================  ===================================
679
version                 Optional version identifier
680
======================  ===================================
681

    
682
|
683

    
684
==========================  ===============================
685
Reply Header Name           Value
686
==========================  ===============================
687
ETag                        The ETag of the object
688
Content-Length              The size of the object
689
Content-Type                The MIME content type of the object
690
Last-Modified               The last object modification date (regardless of version)
691
Content-Encoding            The encoding of the object (optional)
692
Content-Disposition         The presentation style of the object (optional)
693
X-Object-Hash               The Merkle hash
694
X-Object-UUID               The object's UUID
695
X-Object-Version            The object's version identifier
696
X-Object-Version-Timestamp  The object's version timestamp
697
X-Object-Modified-By        The user that comitted the object's version
698
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
699
X-Object-Sharing            Object permissions (optional)
700
X-Object-Shared-By          Object inheriting permissions (optional)
701
X-Object-Allowed-To         Allowed actions on object (optional)
702
X-Object-Public             Object's publicly accessible URI (optional)
703
X-Object-Meta-*             Optional user defined metadata
704
==========================  ===============================
705

    
706
|
707

    
708
================  ===============================
709
Return Code       Description
710
================  ===============================
711
200 (No Content)  The request succeeded
712
================  ===============================
713

    
714

    
715
GET
716
"""
717

    
718
====================  ================================
719
Request Header Name   Value
720
====================  ================================
721
Range                 Optional range of data to retrieve
722
If-Range              Retrieve the missing part if entity is unchanged; otherwise, retrieve the entire new entity (used together with Range header)
723
If-Match              Retrieve if ETags match
724
If-None-Match         Retrieve if ETags don't match
725
If-Modified-Since     Retrieve if object has changed since provided timestamp
726
If-Unmodified-Since   Retrieve if object has not changed since provided timestamp
727
====================  ================================
728

    
729
|
730

    
731
======================  ===================================
732
Request Parameter Name  Value
733
======================  ===================================
734
format                  Optional extended reply type (can be ``json`` or ``xml``)
735
hashmap                 Optional request for hashmap (no value parameter)
736
version                 Optional version identifier or ``list`` (specify a format if requesting a list)
737
======================  ===================================
738

    
739
The reply is the object's data (or part of it), except if a hashmap is requested with ``hashmap``, or a version list with ``version=list`` (in both cases an extended reply format must be specified). Object headers (as in a ``HEAD`` request) are always included.
740

    
741
Hashmaps expose the underlying storage format of the object. Note that each hash is computed after trimming trailing null bytes of the corresponding block. The ``X-Object-Hash`` header reports the single Merkle hash of the object's hashmap (refer to http://bittorrent.org/beps/bep_0030.html for more information).
742

    
743
Example ``format=json`` reply:
744

    
745
::
746

    
747
  {"block_hash": "sha1", "hashes": ["7295c41da03d7f916440b98e32c4a2a39351546c", ...], "block_size": 131072, "bytes": 242}
748

    
749
Example ``format=xml`` reply:
750

    
751
::
752

    
753
  <?xml version="1.0" encoding="UTF-8"?>
754
  <object name="file" bytes="24223726" block_size="131072" block_hash="sha1">
755
    <hash>7295c41da03d7f916440b98e32c4a2a39351546c</hash>
756
    <hash>...</hash>
757
  </object>
758

    
759
Version lists include the version identifier and timestamp for each available object version. Version identifiers can be arbitrary strings, so use the timestamp to find newer versions.
760

    
761
Example ``format=json`` reply:
762

    
763
::
764

    
765
  {"versions": [[85, "1322734861.248469"], [86, "1322734905.009272"], ...]}
766

    
767
Example ``format=xml`` reply:
768

    
769
::
770

    
771
  <?xml version="1.0" encoding="UTF-8"?>
772
  <object name="file">
773
    <version timestamp="1322734861.248469">85</version>
774
    <version timestamp="1322734905.009272">86</version>
775
    <version timestamp="...">...</version>
776
  </object>
777

    
778
The ``Range`` header may include multiple ranges, as outlined in RFC2616. Then the ``Content-Type`` of the reply will be ``multipart/byteranges`` and each part will include a ``Content-Range`` header.
779

    
780
==========================  ===============================
781
Reply Header Name           Value
782
==========================  ===============================
783
ETag                        The ETag of the object
784
Content-Length              The size of the data returned
785
Content-Type                The MIME content type of the object
786
Content-Range               The range of data included (only on a single range request)
787
Last-Modified               The last object modification date (regardless of version)
788
Content-Encoding            The encoding of the object (optional)
789
Content-Disposition         The presentation style of the object (optional)
790
X-Object-Hash               The Merkle hash
791
X-Object-UUID               The object's UUID
792
X-Object-Version            The object's version identifier
793
X-Object-Version-Timestamp  The object's version timestamp
794
X-Object-Modified-By        The user that comitted the object's version
795
X-Object-Manifest           Object parts prefix in ``<container>/<object>`` form (optional)
796
X-Object-Sharing            Object permissions (optional)
797
X-Object-Shared-By          Object inheriting permissions (optional)
798
X-Object-Allowed-To         Allowed actions on object (optional)
799
X-Object-Public             Object's publicly accessible URI (optional)
800
X-Object-Meta-*             Optional user defined metadata
801
==========================  ===============================
802

    
803
Sharing headers (``X-Object-Sharing``, ``X-Object-Shared-By`` and ``X-Object-Allowed-To``) are only included if the request is for the object's latest version (no specific ``version`` parameter is set).
804

    
805
===========================  ==============================
806
Return Code                  Description
807
===========================  ==============================
808
200 (OK)                     The request succeeded
809
206 (Partial Content)        The range request succeeded
810
304 (Not Modified)           The object has not been modified
811
412 (Precondition Failed)    The condition set can not be satisfied
812
416 (Range Not Satisfiable)  The requested range is out of limits
813
===========================  ==============================
814

    
815

    
816
PUT
817
"""
818

    
819
====================  ================================
820
Request Header Name   Value
821
====================  ================================
822
If-Match              Put if ETags match with current object
823
If-None-Match         Put if ETags don't match with current object
824
ETag                  The MD5 hash of the object (optional to check written data)
825
Content-Length        The size of the data written
826
Content-Type          The MIME content type of the object
827
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
828
X-Copy-From           The source path in the form ``/<container>/<object>``
829
X-Move-From           The source path in the form ``/<container>/<object>``
830
X-Source-Account      The source account to copy/move from
831
X-Source-Version      The source version to copy from
832
Content-Encoding      The encoding of the object (optional)
833
Content-Disposition   The presentation style of the object (optional)
834
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
835
X-Object-Sharing      Object permissions (optional)
836
X-Object-Public       Object is publicly accessible (optional)
837
X-Object-Meta-*       Optional user defined metadata
838
====================  ================================
839

    
840
|
841

    
842
======================  ===================================
843
Request Parameter Name  Value
844
======================  ===================================
845
format                  Optional extended request/conflict response type (can be ``json`` or ``xml``)
846
hashmap                 Optional hashmap provided instead of data (no value parameter)
847
======================  ===================================
848

    
849
The request is the object's data (or part of it), except if a hashmap is provided (using ``hashmap`` and ``format`` parameters). If using a hashmap and all different parts are stored in the server, the object is created. Otherwise the server returns Conflict (409) with the list of the missing parts (in simple text format, with one hash per line, or in JSON/XML - depending on the ``format`` parameter).
850

    
851
Hashmaps should be formatted as outlined in ``GET``.
852

    
853
==========================  ===============================
854
Reply Header Name           Value
855
==========================  ===============================
856
ETag                        The MD5 hash of the object
857
X-Object-Version            The object's new version
858
==========================  ===============================
859

    
860
The ``X-Object-Sharing`` header may include either a ``read=...`` comma-separated user/group list, or a ``write=...`` comma-separated user/group list, or both separated by a semicolon (``;``). Groups are specified as ``<account>:<group>``. To publish the object, set ``X-Object-Public`` to ``true``. To unpublish, set to ``false``, or use an empty header value.
861

    
862
==============================  ==============================
863
Return Code                     Description
864
==============================  ==============================
865
201 (Created)                   The object has been created
866
409 (Conflict)                  The object can not be created from the provided hashmap, or there are conflicting permissions (a list of missing hashes, or a list of conflicting sharing paths will be included in the reply)
867
411 (Length Required)           Missing ``Content-Length`` or ``Content-Type`` in the request
868
413 (Request Entity Too Large)  Insufficient quota to complete the request
869
422 (Unprocessable Entity)      The MD5 checksum of the data written to the storage system does not match the (optionally) supplied ETag value
870
==============================  ==============================
871

    
872

    
873
COPY
874
""""
875

    
876
====================  ================================
877
Request Header Name   Value
878
====================  ================================
879
If-Match              Proceed if ETags match with object
880
If-None-Match         Proceed if ETags don't match with object
881
Destination           The destination path in the form ``/<container>/<object>``
882
Destination-Account   The destination account to copy to
883
Content-Type          The MIME content type of the object (optional :sup:`*`)
884
Content-Encoding      The encoding of the object (optional)
885
Content-Disposition   The presentation style of the object (optional)
886
X-Source-Version      The source version to copy from
887
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
888
X-Object-Sharing      Object permissions (optional)
889
X-Object-Public       Object is publicly accessible (optional)
890
X-Object-Meta-*       Optional user defined metadata
891
====================  ================================
892

    
893
:sup:`*` *When using django locally with the supplied web server, use the ignore_content_type parameter, or do provide a valid Content-Type, as a type of text/plain is applied by default to all requests. Client software should always state ignore_content_type, except when a Content-Type is explicitly defined by the user.*
894

    
895
======================  ===================================
896
Request Parameter Name  Value
897
======================  ===================================
898
format                  Optional conflict response type (can be ``json`` or ``xml``)
899
ignore_content_type     Ignore the supplied Content-Type
900
======================  ===================================
901

    
902
Refer to ``PUT``/``POST`` for a description of request headers. Metadata is also copied, updated with any values defined. Sharing/publishing options are not copied.
903

    
904
==========================  ===============================
905
Reply Header Name           Value
906
==========================  ===============================
907
X-Object-Version            The object's new version
908
==========================  ===============================
909

    
910
|
911

    
912
==============================  ==============================
913
Return Code                     Description
914
==============================  ==============================
915
201 (Created)                   The object has been created
916
409 (Conflict)                  There are conflicting permissions (a list of conflicting sharing paths will be included in the reply)
917
413 (Request Entity Too Large)  Insufficient quota to complete the request
918
==============================  ==============================
919

    
920

    
921
MOVE
922
""""
923

    
924
Same as ``COPY``, without the ``X-Source-Version`` request header. The ``MOVE`` operation is always applied on the latest version.
925

    
926

    
927
POST
928
""""
929

    
930
====================  ================================
931
Request Header Name   Value
932
====================  ================================
933
If-Match              Proceed if ETags match with object
934
If-None-Match         Proceed if ETags don't match with object
935
Content-Length        The size of the data written (optional, to update)
936
Content-Type          The MIME content type of the object (optional, to update)
937
Content-Range         The range of data supplied (optional, to update)
938
Transfer-Encoding     Set to ``chunked`` to specify incremental uploading (if used, ``Content-Length`` is ignored)
939
Content-Encoding      The encoding of the object (optional)
940
Content-Disposition   The presentation style of the object (optional)
941
X-Source-Object       Update with data from the object at path ``/<container>/<object>`` (optional, to update)
942
X-Source-Account      The source account to update from
943
X-Source-Version      The source version to update from (optional, to update)
944
X-Object-Bytes        The updated object's final size (optional, when updating)
945
X-Object-Manifest     Object parts prefix in ``<container>/<object>`` form (optional)
946
X-Object-Sharing      Object permissions (optional)
947
X-Object-Public       Object is publicly accessible (optional)
948
X-Object-Meta-*       Optional user defined metadata
949
====================  ================================
950

    
951
|
952

    
953
======================  ============================================
954
Request Parameter Name  Value
955
======================  ============================================
956
format                  Optional conflict response type (can be ``json`` or ``xml``)
957
update                  Do not replace metadata (no value parameter)
958
======================  ============================================
959

    
960
The ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest`` and ``X-Object-Meta-*`` headers are considered to be user defined metadata. An operation without the ``update`` parameter will overwrite all previous values and remove any keys not supplied. When using ``update`` any metadata with an empty value will be deleted.
961

    
962
To change permissions, include an ``X-Object-Sharing`` header (as defined in ``PUT``). To publish, include an ``X-Object-Public`` header, with a value of ``true``. If no such headers are defined, no changes will be applied to sharing/public. Use empty values to remove permissions/unpublish (unpublishing also works with ``false`` as a header value). Sharing options are applied to the object - not its versions.
963

    
964
To update an object's data:
965

    
966
* Either set ``Content-Type`` to ``application/octet-stream``, or provide an object with ``X-Source-Object``. If ``Content-Type`` has some other value, it will be ignored and only the metadata will be updated.
967
* If the data is supplied in the request (using ``Content-Type`` instead of ``X-Source-Object``), a valid ``Content-Length`` header is required - except if using chunked transfers (set ``Transfer-Encoding`` to ``chunked``).
968
* Set ``Content-Range`` as specified in RFC2616, with the following differences:
969

    
970
  * Client software MAY omit ``last-byte-pos`` of if the length of the range being transferred is unknown or difficult to determine.
971
  * Client software SHOULD not specify the ``instance-length`` (use a ``*``), unless there is a reason for performing a size check at the server.
972
* If ``Content-Range`` used has a ``byte-range-resp-spec = *``, data will be appended to the object.
973

    
974
Optionally, truncate the updated object to the desired length with the ``X-Object-Bytes`` header.
975

    
976
A data update will trigger an ETag change. Updated ETags may happen asynchronously and appear at the server with a delay.
977

    
978
No reply content. No reply headers if only metadata is updated.
979

    
980
==========================  ===============================
981
Reply Header Name           Value
982
==========================  ===============================
983
ETag                        The new ETag of the object (data updated)
984
X-Object-Version            The object's new version
985
==========================  ===============================
986

    
987
|
988

    
989
==============================  ==============================
990
Return Code                     Description
991
==============================  ==============================
992
202 (Accepted)                  The request has been accepted (not a data update)
993
204 (No Content)                The request succeeded (data updated)
994
409 (Conflict)                  There are conflicting permissions (a list of conflicting sharing paths will be included in the reply)
995
411 (Length Required)           Missing ``Content-Length`` in the request
996
413 (Request Entity Too Large)  Insufficient quota to complete the request
997
416 (Range Not Satisfiable)     The supplied range is invalid
998
==============================  ==============================
999

    
1000
The ``POST`` method can also be used for creating an object via a standard HTML form. If the request ``Content-Type`` is ``multipart/form-data``, none of the above headers will be processed. The form should have an ``X-Object-Data`` field, as in the following example. The token is passed as a request parameter. ::
1001

    
1002
  <form method="post" action="https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt?X-Auth-Token=0000" enctype="multipart/form-data">
1003
    <input type="file" name="X-Object-Data">
1004
    <input type="submit">
1005
  </form>
1006

    
1007
This will create/override the object with the given name, as if using ``PUT``. The ``Content-Type`` of the object will be set to the value of the corresponding header sent in the part of the request containing the data (usually, automatically handled by the browser). Metadata, sharing and other object attributes can not be set this way.
1008

    
1009
==========================  ===============================
1010
Reply Header Name           Value
1011
==========================  ===============================
1012
ETag                        The MD5 hash of the object
1013
X-Object-Version            The object's new version
1014
==========================  ===============================
1015

    
1016
|
1017

    
1018
==============================  ==============================
1019
Return Code                     Description
1020
==============================  ==============================
1021
201 (Created)                   The object has been created
1022
413 (Request Entity Too Large)  Insufficient quota to complete the request
1023
==============================  ==============================
1024

    
1025

    
1026
DELETE
1027
""""""
1028

    
1029
======================  ===================================
1030
Request Parameter Name  Value
1031
======================  ===================================
1032
until                   Optional timestamp
1033
======================  ===================================
1034

    
1035
If ``until`` is defined, the object is "purged" up to that time (the history up to then is deleted).
1036

    
1037
No reply content/headers.
1038

    
1039
===========================  ==============================
1040
Return Code                  Description
1041
===========================  ==============================
1042
204 (No Content)             The request succeeded
1043
===========================  ==============================
1044

    
1045
Sharing and Public Objects
1046
^^^^^^^^^^^^^^^^^^^^^^^^^^
1047

    
1048
Read and write control in Pithos is managed by setting appropriate permissions with the ``X-Object-Sharing`` header. The permissions are applied using prefix-based inheritance. Thus, each set of authorization directives is applied to all objects sharing the same prefix with the object where the corresponding ``X-Object-Sharing`` header is defined. For simplicity, nested/overlapping permissions are not allowed. Setting ``X-Object-Sharing`` will fail, if the object is already "covered", or another object with a longer common-prefix name already has permissions. When retrieving an object, the ``X-Object-Shared-By`` header reports where it gets its permissions from. If not present, the object is the actual source of authorization directives.
1049

    
1050
A user may ``GET`` another account or container. The result will include a limited reply, containing only the allowed containers or objects respectively. A top-level request with an authentication token, will return a list of allowed accounts, so the user can easily find out which other users share objects. The ``X-Object-Allowed-To`` header lists the actions allowed on an object, if it does not belong to the requesting user.
1051

    
1052
Objects that are marked as public, via the ``X-Object-Public`` meta, are also available at the corresponding URI returned for ``HEAD`` or ``GET``. Requests for public objects do not need to include an ``X-Auth-Token``. Pithos will ignore request parameters and only include the following headers in the reply (all ``X-Object-*`` meta is hidden):
1053

    
1054
==========================  ===============================
1055
Reply Header Name           Value
1056
==========================  ===============================
1057
ETag                        The ETag of the object
1058
Content-Length              The size of the data returned
1059
Content-Type                The MIME content type of the object
1060
Content-Range               The range of data included (only on a single range request)
1061
Last-Modified               The last object modification date (regardless of version)
1062
Content-Encoding            The encoding of the object (optional)
1063
Content-Disposition         The presentation style of the object (optional)
1064
==========================  ===============================
1065

    
1066
Public objects are not included and do not influence cross-user listings. They are, however, readable by all users.
1067

    
1068
Summary
1069
^^^^^^^
1070

    
1071
List of differences from the OOS API:
1072

    
1073
* Support for ``X-Account-Meta-*`` style headers at the account level. Use ``POST`` to update.
1074
* Support for ``X-Container-Meta-*`` style headers at the container level. Can be set when creating via ``PUT``. Use ``POST`` to update.
1075
* Header ``X-Container-Object-Meta`` at the container level and parameter ``meta`` in container listings. (**TBD**)
1076
* Account and container policies to manage behavior and limits. Container behavior overrides account settings. Account quota sets the maximum bytes limit, regardless of container values.
1077
* Headers ``X-Container-Block-*`` at the container level, exposing the underlying storage characteristics.
1078
* All metadata replies, at all levels, include latest modification information.
1079
* At all levels, a ``HEAD`` or ``GET`` request may use ``If-Modified-Since`` and ``If-Unmodified-Since`` headers.
1080
* Container/object lists include all associated metadata if the reply is of type JSON/XML. Some names are kept to their OOS API equivalents for compatibility.
1081
* Option to include only shared containers/objects in listings.
1082
* Object metadata allowed, in addition to ``X-Object-Meta-*``: ``Content-Encoding``, ``Content-Disposition``, ``X-Object-Manifest``. These are all replaced with every update operation, except if using the ``update`` parameter (in which case individual keys can also be deleted). Deleting meta by providing empty values also works when copying/moving an object.
1083
* Multi-range object ``GET`` support as outlined in RFC2616.
1084
* Object hashmap retrieval through ``GET`` and the ``format`` parameter.
1085
* Object create via hashmap through ``PUT`` and the ``format`` parameter.
1086
* The object's Merkle hash is always returned in the ``X-Object-Hash`` header.
1087
* The object's UUID is always returned in the ``X-Object-UUID`` header. The UUID remains unchanged, even when the object's data or metadata changes, or the object is moved to another path (is renamed). A new UUID is assigned when creating or copying an object.
1088
* Object create using ``POST`` to support standard HTML forms.
1089
* Partial object updates through ``POST``, using the ``Content-Length``, ``Content-Type``, ``Content-Range`` and ``Transfer-Encoding`` headers. Use another object's data to update with ``X-Source-Object`` and ``X-Source-Version``. Truncate with ``X-Object-Bytes``.
1090
* Include new version identifier in replies for object replace/change requests.
1091
* Object ``MOVE`` support and ``ignore_content_type`` parameter in both ``COPY`` and ``MOVE``.
1092
* Conditional object create/update operations, using ``If-Match`` and ``If-None-Match`` headers.
1093
* Time-variant account/container listings via the ``until`` parameter.
1094
* Object versions - parameter ``version`` in ``HEAD``/``GET`` (list versions with ``GET``), ``X-Object-Version-*`` meta in replies, ``X-Source-Version`` in ``PUT``/``COPY``.
1095
* Sharing/publishing with ``X-Object-Sharing``, ``X-Object-Public`` at the object level. Cross-user operations are allowed - controlled by sharing directives. Available actions in cross-user requests are reported with ``X-Object-Allowed-To``. Permissions may include groups defined with ``X-Account-Group-*`` at the account level. These apply to the object - not its versions.
1096
* Support for prefix-based inheritance when enforcing permissions. Parent object carrying the authorization directives is reported in ``X-Object-Shared-By``.
1097
* Copy and move between accounts with ``X-Source-Account`` and ``Destination-Account`` headers.
1098
* Large object support with ``X-Object-Manifest``.
1099
* Trace the user that created/modified an object with ``X-Object-Modified-By``.
1100
* Purge container/object history with the ``until`` parameter in ``DELETE``.
1101

    
1102
Clarifications/suggestions:
1103

    
1104
* All non-ASCII characters in headers should be URL-encoded.
1105
* Authentication is done by another system. The token is used in the same way, but it is obtained differently. The top level ``GET`` request is kept compatible with the OOS API and allows for guest/testing operations.
1106
* Some processing is done in the variable part of all ``X-*-Meta-*`` headers. If it includes underscores, they will be converted to dashes and the first letter of all intra-dash strings will be capitalized.
1107
* A ``GET`` reply for a level will include all headers of the corresponding ``HEAD`` request.
1108
* To avoid conflicts between objects and virtual directory markers in container listings, it is recommended that object names do not end with the delimiter used.
1109
* The ``Accept`` header may be used in requests instead of the ``format`` parameter to specify the desired request/reply format. The parameter overrides the header.
1110
* Container/object lists use a ``200`` return code if the reply is of type JSON/XML. The reply will include an empty JSON/XML.
1111
* In headers, dates are formatted according to RFC 1123. In extended information listings, the ``last_modified`` field is formatted according to ISO 8601 (for OOS API compatibility). All other fields (Pithos extensions) use integer tiemstamps.
1112
* The ``Last-Modified`` header value always reflects the actual latest change timestamp, regardless of time control parameters and version requests. Time precondition checks with ``If-Modified-Since`` and ``If-Unmodified-Since`` headers are applied to this value.
1113
* A copy/move using ``PUT``/``COPY``/``MOVE`` will always update metadata, keeping all old values except the ones redefined in the request headers.
1114
* A ``HEAD`` or ``GET`` for an ``X-Object-Manifest`` object, will include modified ``Content-Length`` and ``ETag`` headers, according to the characteristics of the objects under the specified prefix. The ``Etag`` will be the MD5 hash of the corresponding ETags concatenated. In extended container listings there is no metadata processing.
1115

    
1116
The Pithos Client
1117
-----------------
1118

    
1119
User Experience
1120
^^^^^^^^^^^^^^^
1121

    
1122
Hopefully this API will allow for a multitude of client implementations, each supporting a different device or operating system. All clients will be able to manipulate containers and objects - even software only designed for OOS API compatibility. But a Pithos interface should not be only about showing containers and folders. There are some extra user interface elements and functionalities that should be common to all implementations.
1123

    
1124
Upon entrance to the service, a user is presented with the following elements - which can be represented as folders or with other related icons:
1125

    
1126
* The ``home`` element, which is used as the default entry point to the user's "files". Objects under ``home`` are represented in the usual hierarchical organization of folders and files.
1127
* The ``trash`` element, which contains files that have been marked for deletion, but can still be recovered.
1128
* The ``shared`` element, which contains all objects shared by the user to other users of the system.
1129
* The ``others`` element, which contains all objects that other users share with the user.
1130
* The ``groups`` element, which contains the names of groups the user has defined. Each group consists of a user list. Group creation, deletion, and manipulation is carried out by actions originating here.
1131
* The ``history`` element, which allows browsing past instances of ``home`` and - optionally - ``trash``.
1132

    
1133
Objects in Pithos can be:
1134

    
1135
* Moved to trash and then deleted.
1136
* Shared with specific permissions.
1137
* Made public (shared with non-Pithos users).
1138
* Restored from previous versions.
1139

    
1140
Some of these functions are performed by the client software and some by the Pithos server.
1141

    
1142
In the first version of Pithos, objects could also be assigned custom tags. This is no longer supported. Existing deployments can migrate tags into a specific metadata value, i.e. ``X-Object-Meta-Tags``.
1143

    
1144
Implementation Guidelines
1145
^^^^^^^^^^^^^^^^^^^^^^^^^
1146

    
1147
Pithos clients should use the ``pithos`` and ``trash`` containers for active and inactive objects respectively. If any of these containers is not found, the client software should create it, without interrupting the user's workflow. The ``home`` element corresponds to ``pithos`` and the ``trash`` element to ``trash``. Use ``PUT`` with the ``X-Move-From`` header, or ``MOVE`` to transfer objects from one container to the other. Use ``DELETE`` to remove from ``pithos`` without trashing, or to remove from ``trash``. When moving objects, detect naming conflicts with the ``If-Match`` or ``If-None-Match`` headers. Such conflicts should be resolved by the user.
1148

    
1149
Object names should use the ``/`` delimiter to impose a hierarchy of folders and files.
1150

    
1151
The ``shared`` element should be implemented as a read-only view of the ``pithos`` container, using the ``shared`` parameter when listing objects. The ``others`` element, should start with a top-level ``GET`` to retrieve the list of accounts accessible to the user. It is suggested that the client software hides the next step of navigation - the container - if it only includes ``pithos`` and forwards the user directly to the objects.
1152

    
1153
Public objects are not included in ``shared`` and ``others`` listings. It is suggested that they are marked in a visually distinctive way in ``pithos`` listings (for example using an icon overlay).
1154

    
1155
A special application menu, or a section in application preferences, should be devoted to managing groups (the ``groups`` element). All group-related actions are implemented at the account level.
1156

    
1157
Browsing past versions of objects should be available both at the object and the container level. At the object level, a list of past versions can be included in the screen showing details or more information on the object (metadata, permissions, etc.). At the container level, it is suggested that clients use a ``history`` element, which presents to the user a read-only, time-variable view of ``pithos`` contents. This can be accomplished via the ``until`` parameter in listings. Optionally, ``history`` may include ``trash``.
1158

    
1159
Uploading and downloading data
1160
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1161

    
1162
By using hashmaps to upload and download objects the corresponding operations can complete much faster.
1163

    
1164
In the case of an upload, only the missing blocks will be submitted to the server:
1165

    
1166
* Calculate the hash value for each block of the object to be uploaded. Use the hash algorithm and block size of the destination container.
1167
* Send a hashmap ``PUT`` request for the object.
1168

    
1169
  * Server responds with status ``201`` (Created):
1170

    
1171
    * Blocks are already on the server. The object has been created. Done.
1172

    
1173
  * Server responds with status ``409`` (Conflict):
1174

    
1175
    * Server's response body contains the hashes of the blocks that do not exist on the server.
1176
    * For each hash value in the server's response (or all hashes together):
1177

    
1178
      * Send a ``POST`` request to the destination container with the corresponding data.
1179

    
1180
* Repeat hashmap ``PUT``. Fail if the server's response is not ``201``.
1181

    
1182
Consulting hashmaps when downloading allows for resuming partially transferred objects. The client should retrieve the hashmap from the server and compare it with the hashmap computed from the respective local file. Any missing parts can be downloaded with ``GET`` requests with the additional ``Range`` header.
1183

    
1184
Syncing
1185
^^^^^^^
1186

    
1187
Consider the following algorithm for synchronizing a local folder with the server. The "state" is the complete object listing, with the corresponding attributes.
1188
 
1189
::
1190

    
1191
  L: local state (stored state from last sync with the server)
1192
  C: current state (state computed right before sync)
1193
  S: server state
1194

    
1195
  if C == L:
1196
      # No local changes
1197
      if S == L:
1198
          # No remote changes, nothing to do
1199
      else:
1200
          # Update local state to match that of the server
1201
         L = S
1202
  else:
1203
      # We have local changes
1204
      if S == L:
1205
          # No remote changes, update the server
1206
          S = C
1207
          L = S
1208
      else:
1209
          # Both we and server have changes
1210
          if C == S:
1211
              # We were lucky, we did the same change
1212
              L = S
1213
          else:
1214
              # We have conflicting changes
1215
              resolve conflict
1216

    
1217
Notes:
1218

    
1219
* States represent file hashes (it is suggested to use Merkle). Deleted or non-existing files are assumed to have a magic hash (e.g. empty string).
1220
* Updating a state (either local or remote) implies downloading, uploading or deleting the appropriate file.
1221

    
1222
Recommended Practices and Examples
1223
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1224

    
1225
Assuming an authentication token is obtained, the following high-level operations are available - shown with ``curl``:
1226

    
1227
* Get account information ::
1228

    
1229
    curl -X HEAD -D - \
1230
         -H "X-Auth-Token: 0000" \
1231
         https://pithos.dev.grnet.gr/v1/user
1232

    
1233
* List available containers ::
1234

    
1235
    curl -X GET -D - \
1236
         -H "X-Auth-Token: 0000" \
1237
         https://pithos.dev.grnet.gr/v1/user
1238

    
1239
* Get container information ::
1240

    
1241
    curl -X HEAD -D - \
1242
         -H "X-Auth-Token: 0000" \
1243
         https://pithos.dev.grnet.gr/v1/user/pithos
1244

    
1245
* Add a new container ::
1246

    
1247
    curl -X PUT -D - \
1248
         -H "X-Auth-Token: 0000" \
1249
         https://pithos.dev.grnet.gr/v1/user/test
1250

    
1251
* Delete a container ::
1252

    
1253
    curl -X DELETE -D - \
1254
         -H "X-Auth-Token: 0000" \
1255
         https://pithos.dev.grnet.gr/v1/user/test
1256

    
1257
* List objects in a container ::
1258

    
1259
    curl -X GET -D - \
1260
         -H "X-Auth-Token: 0000" \
1261
         https://pithos.dev.grnet.gr/v1/user/pithos
1262

    
1263
* List objects in a container (extended reply) ::
1264

    
1265
    curl -X GET -D - \
1266
         -H "X-Auth-Token: 0000" \
1267
         https://pithos.dev.grnet.gr/v1/user/pithos?format=json
1268

    
1269
  It is recommended that extended replies are cached and subsequent requests utilize the ``If-Modified-Since`` header.
1270

    
1271
* List metadata keys used by objects in a container
1272

    
1273
  Will be in the ``X-Container-Object-Meta`` reply header, included in container information or object list (``HEAD`` or ``GET``). (**TBD**)
1274

    
1275
* List objects in a container having a specific meta defined ::
1276

    
1277
    curl -X GET -D - \
1278
         -H "X-Auth-Token: 0000" \
1279
         https://pithos.dev.grnet.gr/v1/user/pithos?meta=favorites
1280

    
1281
* Retrieve an object ::
1282

    
1283
    curl -X GET -D - \
1284
         -H "X-Auth-Token: 0000" \
1285
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
1286

    
1287
* Retrieve an object (specific ranges of data) ::
1288

    
1289
    curl -X GET -D - \
1290
         -H "X-Auth-Token: 0000" \
1291
         -H "Range: bytes=0-9" \
1292
         https://pithos.dev.grnet.gr/v1/user/pithos/README.txt
1293

    
1294
  This will return the first 10 bytes. To get the first 10, bytes 30-39 and the last 100 use ``Range: bytes=0-9,30-39,-100``.
1295

    
1296
* Add a new object (folder type) (**TBD**) ::
1297

    
1298
    curl -X PUT -D - \
1299
         -H "X-Auth-Token: 0000" \
1300
         -H "Content-Type: application/directory" \
1301
         https://pithos.dev.grnet.gr/v1/user/pithos/folder
1302

    
1303
* Add a new object ::
1304

    
1305
    curl -X PUT -D - \
1306
         -H "X-Auth-Token: 0000" \
1307
         -H "Content-Type: text/plain" \
1308
         -T EXAMPLE.txt
1309
         https://pithos.dev.grnet.gr/v1/user/pithos/folder/EXAMPLE.txt
1310

    
1311
* Update an object ::
1312

    
1313
    curl -X POST -D - \
1314
         -H "X-Auth-Token: 0000" \
1315
         -H "Content-Length: 10" \
1316
         -H "Content-Type: application/octet-stream" \
1317
         -H "Content-Range: bytes 10-19/*" \
1318
         -d "0123456789" \
1319
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1320

    
1321
  This will update bytes 10-19 with the data specified.
1322

    
1323
* Update an object (append) ::
1324

    
1325
    curl -X POST -D - \
1326
         -H "X-Auth-Token: 0000" \
1327
         -H "Content-Length: 10" \
1328
         -H "Content-Type: application/octet-stream" \
1329
         -H "Content-Range: bytes */*" \
1330
         -d "0123456789" \
1331
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1332

    
1333
* Update an object (truncate) ::
1334

    
1335
    curl -X POST -D - \
1336
         -H "X-Auth-Token: 0000" \
1337
         -H "X-Source-Object: /folder/EXAMPLE.txt" \
1338
         -H "Content-Range: bytes 0-0/*" \
1339
         -H "X-Object-Bytes: 0" \
1340
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1341

    
1342
  This will truncate the object to 0 bytes.
1343

    
1344
* Add object metadata ::
1345

    
1346
    curl -X POST -D - \
1347
         -H "X-Auth-Token: 0000" \
1348
         -H "X-Object-Meta-First: first_meta_value" \
1349
         -H "X-Object-Meta-Second: second_meta_value" \
1350
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1351

    
1352
* Delete object metadata ::
1353

    
1354
    curl -X POST -D - \
1355
         -H "X-Auth-Token: 0000" \
1356
         -H "X-Object-Meta-First: first_meta_value" \
1357
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt
1358

    
1359
  Metadata can only be "set". To delete ``X-Object-Meta-Second``, reset all metadata.
1360

    
1361
* Delete an object ::
1362

    
1363
    curl -X DELETE -D - \
1364
         -H "X-Auth-Token: 0000" \
1365
         https://pithos.dev.grnet.gr/v1/user/folder/EXAMPLE.txt