feat(sample-meta): L1 filename-metadata harvester (free per-file signal)
RIFF chunk dump proved samples carry NO semantic embedded metadata (only encoder tags) — the Pulsar browser shows FILENAMES. So harvest the filename: leading pad index + instrument-token lexicon → fleet family + source hint. Conservative: opaque names (JUPI, doing_it_right, 808hc's 'HC') stay family=None → fall back to audio. Detects kit-like folders (≥2 families by name), the 'jazz is a kit' case. Corpus coverage: 49% folders / 31% files named, 36 kit-like folders.
Showing
armada/tide-table/sample_meta.py
0 → 100644
Please
register
or
sign in
to comment