This guide provides comprehensive documentation for the Catalyzed platform's dataset API suite, including datasets, schema versions, and data versions management.

Overview

The dataset API suite consists of three interconnected services:

For authentication, request formatting, pagination, rate limits, and other common API patterns, please refer to our comprehensive REST API documentation.

Base URL: https://platform-api.us.catalyzed.ai

Core Concepts

Dataset Lifecycle

  1. Create Dataset - Establish dataset metadata and permissions
  2. Define Schema - Create schema version with column definitions and transformations
  3. Upload Data - Create data version and upload file content
  4. Process Dataset - Trigger ML/AI processing pipeline
  5. Query & Download - Access processed data and results

Version Management

Both schema and data use a versioning system: