This is the github for Hyperparam, where we share open-source contributions to the AI and Data Engineering communities. AI needs lots of data, so we're building tools for working with massive text datasets in the browser.
👀 Hyperparam CLI — Scalable dataset viewer for machine learning datasets.
🦜 Hyparquet — Parquet file parser for loading datasets in the browser.
🐤 Hyparquet Writer — Parquet file writer in JavaScript.
🐦 Hyparquet Compressors — Decompress every parquet compression format.
🐧 Hysnappy — Snappy compression optimized with WebAssembly for faster parquet parsing.
🏛️ HighTable — Windowed table component for viewing arbitrarily large datasets.
⛄ Icebird — Apache Iceberg table reader in JavaScript.
🐿️ Squirreling — Async SQL engine for querying large datasets in the browser.
🦙 HyLlama — Parse metadata from llama.cpp gguf files in JavaScript.