Successful customer orders in the webshop.
All orders since 2020-01-01.
Orders with their line items are in their current state (no history included).
Data can be used for reports, analytics and machine learning use cases. Order may be linked and joined by other tables
Successful customer orders in the webshop. All orders since 2020-01-01. Orders with their line items are in their current state (no history included).
Not suitable for real-time use cases. Data may not be used to identify individual customers. Max data processing per day: 10 TiB
| Name | Type | Attributes |
| —- | —- | ———- |
| production | s3 | One folder per model. One file per day.
• environment: prod
• roles: [{‘role’: ‘analyst_us’, ‘description’: ‘Access to the data for US region’}, {‘role’: ‘analyst_cn’, ‘description’: ‘Access to the data for China region’}]
• delimiter: new_line
• format: json
• location: s3://datacontract-example-orders-latest/v2/{model}/.json |
| development | s3 | *One folder per model. One file per day.
• environment: dev
• roles: [{‘role’: ‘analyst_us’, ‘description’: ‘Access to the data for US region’}, {‘role’: ‘analyst_cn’, ‘description’: ‘Access to the data for China region’}]
• delimiter: new_line
• format: json
• location: s3://datacontract-example-orders-latest/v2/{model}/*.json |
One record per order. Includes cancelled and deleted orders.
| Field | Type | Attributes |
| —– | —- | ———- |
| order_id | string | An internal ID that identifies an order in the online shop.
• businessName: Order ID
• tags: [‘orders’]
• customProperties: [{‘property’: ‘pii’, ‘value’: ‘True’}]
• primaryKey
• logicalTypeOptions: {‘format’: ‘uuid’}
• required
• unique
• classification: restricted |
| order_timestamp | timestamp | The business timestamp in UTC when the order was successfully registered in the source system and the payment was successful.
• tags: [‘business-timestamp’]
• required |
| order_total | integer | Total amount the smallest monetary unit (e.g., cents).
• required |
| customer_id | string | Unique identifier for the customer.
• logicalTypeOptions: {‘minLength’: 10, ‘maxLength’: 20} |
| customer_email_address | string | The email address, as entered by the customer.
• customProperties: [{‘property’: ‘pii’, ‘value’: ‘True’}]
• logicalTypeOptions: {‘format’: ‘email’}
• required
• classification: sensitive
• transformSourceObjects: [‘com.example.service.checkout.checkout_db.orders.email_address’]
• quality: [{‘description’: ‘The email address is not verified and may be invalid.’, ‘type’: ‘text’}] |
| processed_timestamp | timestamp | The timestamp when the record was processed by the data platform.
• customProperties: [{‘property’: ‘jsonType’, ‘value’: ‘string’}, {‘property’: ‘jsonFormat’, ‘value’: ‘date-time’}]
• required |
A single article that is part of an order.
| Field | Type | Attributes |
|---|---|---|
| line_item_id | string | Primary key of the lines_item_id table • primaryKey• primaryKeyPosition: 2 • required |
| order_id | string | An internal ID that identifies an order in the online shop. • businessName: Order ID • tags: [‘orders’] • customProperties: [{‘property’: ‘pii’, ‘value’: ‘True’}] • primaryKey• primaryKeyPosition: 1 • logicalTypeOptions: {‘format’: ‘uuid’} • classification: restricted • relationships: [{‘type’: ‘foreignKey’, ‘to’: ‘orders.order_id’}] |
| sku | string | The purchased article number • businessName: Stock Keeping Unit • tags: [‘inventory’] • logicalTypeOptions: {‘pattern’: ‘^[A-Za-z0-9]{8,14}$’} |
| Property | Value | Unit | | ——– | —– | —- | | generalAvailability | The server is available during support hours | | | retention | P1Y | | | freshness | 25 | h | | latency | 25 | h | | frequency | daily | | | support | 9am to 5pm in EST on business days | | | backup | weekly | |