📦
Caret
Terminal tool for inspecting and cleaning large LLM training datasets. Handles JSONL, Parquet, and CSV with memory-mapped I/O, near-duplicate detection, token visualization, dataset linting, and an MCP server.
0 installs
Trust: 39 — Low
Devtools
Ask AI about Caret
Powered by Claude · Grounded in docs
I know everything about Caret. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Loading tools...
Reviews
Documentation
No README available
