1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179
|
#ifndef AWS_S3_CHECKSUMS_H
#define AWS_S3_CHECKSUMS_H
/**
* Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
* SPDX-License-Identifier: Apache-2.0.
*/
#include "aws/s3/s3_client.h"
/* TODO: consider moving the aws_checksum_stream to aws-c-checksum, and the rest about checksum headers and trailer to
* aws-c-sdkutil. */
struct aws_s3_checksum;
/* List to check the checksum algorithm to use based on the priority. */
static const enum aws_s3_checksum_algorithm s_checksum_algo_priority_list[] = {
AWS_SCA_CRC64NVME,
AWS_SCA_CRC32C,
AWS_SCA_CRC32,
AWS_SCA_SHA1,
AWS_SCA_SHA256,
};
AWS_STATIC_ASSERT(AWS_ARRAY_SIZE(s_checksum_algo_priority_list) == (AWS_SCA_END - AWS_SCA_INIT + 1));
struct aws_checksum_vtable {
void (*destroy)(struct aws_s3_checksum *checksum);
int (*update)(struct aws_s3_checksum *checksum, const struct aws_byte_cursor *buf);
int (*finalize)(struct aws_s3_checksum *checksum, struct aws_byte_buf *out);
};
struct aws_s3_checksum {
struct aws_allocator *allocator;
struct aws_checksum_vtable *vtable;
size_t digest_size;
enum aws_s3_checksum_algorithm algorithm;
bool good;
union {
struct aws_hash *hash;
uint32_t crc_val_32bit;
uint64_t crc_val_64bit;
} impl;
};
struct checksum_config_storage {
struct aws_allocator *allocator;
struct aws_byte_buf full_object_checksum;
bool has_full_object_checksum;
aws_s3_meta_request_full_object_checksum_fn *full_object_checksum_callback;
void *user_data;
enum aws_s3_checksum_location location;
enum aws_s3_checksum_algorithm checksum_algorithm;
bool validate_response_checksum;
struct {
bool crc64nvme;
bool crc32c;
bool crc32;
bool sha1;
bool sha256;
} response_checksum_algorithms;
};
/**
* a stream that takes in a stream, computes a running checksum as it is read, and outputs the checksum when the stream
* is destroyed.
* Note: seek this stream will immediately fail, as it would prevent an accurate calculation of the
* checksum.
*
* @param allocator
* @param existing_stream The real content to read from. Destroying the checksum stream destroys the existing stream.
* outputs the checksum of existing stream to checksum_output upon destruction. Will be kept
* alive by the checksum stream
* @param algorithm Checksum algorithm to use.
* @param checksum_output Checksum of the `existing_stream`, owned by caller, which will be calculated when this stream
* is destroyed.
*/
AWS_S3_API
struct aws_input_stream *aws_checksum_stream_new(
struct aws_allocator *allocator,
struct aws_input_stream *existing_stream,
enum aws_s3_checksum_algorithm algorithm,
struct aws_byte_buf *checksum_output);
/**
* TODO: properly support chunked encoding.
*
* A stream that takes in a stream, encodes it to aws_chunked. Computes a running checksum as it is read and add the
* checksum as trailer at the end of the stream. All of the added bytes will be counted to the length of the stream.
* Note: seek this stream will immediately fail, as it would prevent an accurate calculation of the
* checksum.
*
* @param allocator
* @param existing_stream The data to be chunkified prepended by information on the stream length followed by a final
* chunk and a trailing chunk containing a checksum of the existing stream. Destroying the
* chunk stream will destroy the existing stream.
* @param checksum_output Optional argument, if provided the buffer will be initialized to the appropriate size and
* filled with the checksum result when calculated. Callers responsibility to cleanup.
*/
AWS_S3_API
struct aws_input_stream *aws_chunk_stream_new(
struct aws_allocator *allocator,
struct aws_input_stream *existing_stream,
enum aws_s3_checksum_algorithm algorithm,
struct aws_byte_buf *checksum_output);
/**
* Get the size of the checksum output corresponding to the aws_s3_checksum_algorithm enum value.
*/
AWS_S3_API
size_t aws_get_digest_size_from_checksum_algorithm(enum aws_s3_checksum_algorithm algorithm);
/**
* Get header name to use for algorithm (e.g. "x-amz-checksum-crc32")
*/
AWS_S3_API
struct aws_byte_cursor aws_get_http_header_name_from_checksum_algorithm(enum aws_s3_checksum_algorithm algorithm);
/**
* Get algorithm's name (e.g. "CRC32"), to be used as the value of headers like `x-amz-checksum-algorithm`
*/
AWS_S3_API
struct aws_byte_cursor aws_get_checksum_algorithm_name(enum aws_s3_checksum_algorithm algorithm);
/**
* Get the name of checksum algorithm to be used as the details of the parts were uploaded. Referring to
* https://docs.aws.amazon.com/AmazonS3/latest/API/API_CompletedPart.html#AmazonS3-Type-CompletedPart
*/
AWS_S3_API
struct aws_byte_cursor aws_get_completed_part_name_from_checksum_algorithm(enum aws_s3_checksum_algorithm algorithm);
/**
* create a new aws_checksum corresponding to the aws_s3_checksum_algorithm enum value.
*/
AWS_S3_API
struct aws_s3_checksum *aws_checksum_new(struct aws_allocator *allocator, enum aws_s3_checksum_algorithm algorithm);
/**
* Compute an aws_checksum corresponding to the provided enum, passing a function pointer around instead of using a
* conditional would be faster, but would be a negligible improvement compared to the cost of processing data twice
* which would be the only time this function would be used, and would be harder to follow.
*/
AWS_S3_API
int aws_checksum_compute(
struct aws_allocator *allocator,
enum aws_s3_checksum_algorithm algorithm,
const struct aws_byte_cursor *input,
struct aws_byte_buf *output);
/**
* Cleans up and deallocates checksum.
*/
AWS_S3_API
void aws_checksum_destroy(struct aws_s3_checksum *checksum);
/**
* Updates the running checksum with to_checksum. this can be called multiple times.
*/
AWS_S3_API
int aws_checksum_update(struct aws_s3_checksum *checksum, const struct aws_byte_cursor *to_checksum);
/**
* Completes the checksum computation and writes the final digest to output.
* Allocation of output is the caller's responsibility.
*/
AWS_S3_API
int aws_checksum_finalize(struct aws_s3_checksum *checksum, struct aws_byte_buf *output);
AWS_S3_API
int aws_checksum_config_storage_init(
struct aws_allocator *allocator,
struct checksum_config_storage *internal_config,
const struct aws_s3_checksum_config *config,
const struct aws_http_message *message,
const void *log_id);
AWS_S3_API
void aws_checksum_config_storage_cleanup(struct checksum_config_storage *internal_config);
#endif /* AWS_S3_CHECKSUMS_H */
|