pytajc
Pytajc is an open-source Python library and command-line tool designed to simplify data transformation and validation tasks across common data formats. It provides both a programmatic API and a CLI for parsing inputs, applying transformations, validating data against schemas, and exporting results to formats such as JSON, CSV, YAML, and XML. The project emphasizes small dependencies, clear error reporting, and straightforward integration into ETL workflows.
- Cross-format conversion between JSON, CSV, YAML, and XML
- Schema-based validation with optional JSON Schema compatibility
- Streaming and batch processing support
- Composable transformation pipelines and custom operators
- Plugin system for domain-specific operations
- Command-line tools for common ETL tasks
- Extensible adapters for input/output formats
- Portable, well-documented, and MIT-licensed
Pytajc uses a modular design with a core engine that routes data between format handlers, a set
The project originated as a small open-source effort to reduce boilerplate in ETL scripting. It has
Data transformation, ETL, JSON, CSV, YAML, XML, Python (programming language).