You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 12 Next »

STAC ICD for PRIP can be downloaded from this link.


Below table is available to collect the comments on this version of the document.


Id.AuthorSection/Page numberTitleCommentAnswer
1m-mohrp. 7

EOPF_SPECIFICATION

Missing link, there's a <tbd> instead.

2

CloudFerrop. 21REQ-0050timestamp precision should be defined to avoid differences between PRIP implementationsProposal to keep the same precision as for CDSE (second)
3CloudFerrop. 25REQ-0085thumbnail size and resolution must be defined to create consistency between product types. if no thumbnail is generated by PS then this asset should be empty or removed. whatever is included in the products  and does not fit the requirements should be listed as asset type "preview" or similar.See higher level document : STAC Requirements and Recommendations for EOF Services
4CloudFerrop. 25 REQ-0090there are various media types within ZARR, so setting mime type of all objects to "application/vnd-zarr" may cause some strange behaviour of standard client libs
5CloudFerrop. 28REQ-0207it is not needed for PRIP to enable searching by any common attribute, limited subset will suffice. PRIP API is not for end-users and used only for systematic downloadsESA asked for this level of searching capabilities
6.CloudFerrop. 29REQ-0305, REQ-0410multipart uploads to object storage may affect the object checksum so in some cases it may be unreliable (TBC) with multipart upload single-part upload md5 != multipart-download md5It's up to client's implementation and should be adressed by "tailored ICD"
7CloudFerrop. 31 REQ-0391no relation between accounts in central identity and s3 credentials make the management of access prone to errors and delays in provisioning of access. s3 credentials should be self-managed and reflect access roles (similar to CF S3 keys manager)

No known standard to include s3 credentials into IAM → out of scope of this ICD.

It must be described by infrastructure provider.

8CloudFerrop. 31REQ-0400regarding: "

PRIP client point of view, retrieving an asset needs to:

List all S3 objects sharing the prefix provided by the asset link, leveraging the ListObjectsV2 S3 operation

"
prefix listing is time consuming and might be expensive (and will raise the maintenance cost of the PRIP) for multiple requests and will affect timeliness. compressed product recommended.

Question already discussed with ESA, and submitted to its arbitration.

In which way it can raise the maintenance cost ? 

9. CloudFerro
STAC-PRIP-API-REQ-0210Access control to the STAC Catalogue is implemented at the Collection level while CDSE manages it on the product_type level. How collection and product types will be joined?

One collection corresponds to a unique product_type.

Besides, PRIP is not intended for for end-users, thus access control is less critical.

10.CloudFerro
STAC-PRIP-API-REQ-0340Quota Management expressed as n terra bytes per month <- what if a reprocessing campaign is published. Should every time the quota should be adjusted?

Quota adjustement should be managed at operations level and arbitrated by ESA.

Not to describe in the ICD

11. CloudFerro

lack of requirement regarding the maximum allowable number of items returned from a STAC queryWe propoe to add an additional requirement on the maximum number of returned items as configurable 
12.CloudFerrop. 22
publication date - Timestamp of the availability of the current Item in the PRIP instance Catalogue.
update date - If STAC Item properties have been updated, timestamp of the last update.

Regarding the change of the modification date it is not clear if the LTA/CDSE should re-download the file.

13CloudFerro

File extension should be if the zipped/concatenated product is to be exposed. The file:size and file:checksum https://github.com/stac-extensions/fileZarr item is provided in extended form (folders & sub-folders) → neither file extension nor checksum are relevant here
14CloudFerroSection 3.4.3.4Downloading ZARR content

More info needed regarding "Note:  It  is  assumed  that  the  AWS  CLI  is  configured  with  the  expected  S3  API  endpoint  and authentication credentials."

 It is not clear how to the authentification will look like. Sample AWS CLI config file would be appreciated. Are the short-term credentials to be used or the long-term ( https://docs.aws.amazon.com/cli/v1/userguide/cli-configure-files.html )? Are the https://docs.unified-streaming.com/documentation/vod/cloud/amazon/aws_s3_authentication.html#aws-security-tokens to be used?

Should depend on the provider → not candidate for the ICD. rather provided in tailored ICD

15Cloudferro
Bulk download via a single zipped file without compressions.Downloading multiple small zarr chunk files will generate huge overload regarding number of network requests and md5/checksums checks. Maybe we could adapt chunks size for storage usage, in order to reduce the number of objects
























  • No labels