Download
A curated sample of 1,000 recent COLA records (2025 approvals) with associated label images and extracted barcodes. Download cola-sample-pack-v1.zip (~500 KB) No account or API key required.Contents
The ZIP contains three CSV files:| File | Rows | Description |
|---|---|---|
cola.csv | 1,000 | Product label approval records — brand, product type, origin, OCR-extracted ABV/volume, LLM-enriched category, tasting notes, and more |
cola_image.csv | ~1,750 | Label images for the 1,000 COLAs — dimensions, container position (front/back/neck/strip), and OCR-extracted text |
cola_image_barcode.csv | ~500 | Barcodes extracted from label images — type (UPC-A, EAN-13, QR, etc.), decoded value, pixel position |
Relationships
Key columns
cola.csv includes 60+ columns. Highlights:TTB_ID— unique identifier for each COLA approvalBRAND_NAME,PRODUCT_NAME— the productPRODUCT_TYPE— Wine, Malt Beverage, or Distilled SpiritsLLM_CATEGORY,LLM_CATEGORY_PATH— AI-classified product taxonomyLLM_PRODUCT_DESCRIPTION— natural language product description from label readingOCR_ABV— alcohol by volume, extracted via OCRBARCODE_VALUE,BARCODE_TYPE— primary barcode from the labelMAIN_TTB_IMAGE_ID— viewable athttps://dyuie4zgfxmt6.cloudfront.net/{TTB_IMAGE_ID}.webp
Full dataset
The sample represents a small slice of the full COLA Cloud dataset:- 2.9M+ COLA records (back to 2005)
- 5M+ label images
- 575K+ extracted barcodes
- Updated daily (~2,500 new approvals per week)

