RIFF chunk dump proved samples carry NO semantic embedded metadata (only encoder tags) — the Pulsar browser shows FILENAMES. So harvest the filename: leading pad index + instrument-token lexicon → fleet family + source hint. Conservative: opaque names (JUPI, doing_it_right, 808hc's 'HC') stay family=None → fall back to audio. Detects kit-like folders (≥2 families by name), the 'jazz is a kit' case. Corpus coverage: 49% folders / 31% files named, 36 kit-like folders.
| Name |
Last commit
|
Last update |
|---|---|---|
| .. | ||
| punkachien | Loading commit data... | |
| sources | Loading commit data... | |
| tests | Loading commit data... | |
| .gitignore | Loading commit data... | |
| README.md | Loading commit data... | |
| ardour_stem_export.py | Loading commit data... | |
| audio_lens.py | Loading commit data... | |
| backlog_setlists.json | Loading commit data... | |
| backlog_setlists.py | Loading commit data... | |
| boundaries_take89_validated.json | Loading commit data... | |
| boundary_bleed.py | Loading commit data... | |
| boundary_bleed_take89.json | Loading commit data... | |
| build_catalog.py | Loading commit data... | |
| build_catalog_view.py | Loading commit data... | |
| build_corpus.py | Loading commit data... | |
| build_track_recording_map.py | Loading commit data... | |
| build_triage_ui.py | Loading commit data... | |
| catalog.authored.yaml | Loading commit data... | |
| catalog.generated.json | Loading commit data... | |
| catalog.yaml | Loading commit data... | |
| catalog_view.json | Loading commit data... | |
| corpus-viz-plan.md | Loading commit data... | |
| corpus.html | Loading commit data... | |
| eda_report.json | Loading commit data... | |
| edl_render.py | Loading commit data... | |
| gap-report.md | Loading commit data... | |
| locate-matrix.md | Loading commit data... | |
| master_align.json | Loading commit data... | |
| master_edl_take89.json | Loading commit data... | |
| models.py | Loading commit data... | |
| pattern_ngrams.py | Loading commit data... | |
| pattern_registry.json | Loading commit data... | |
| performance_notes.md | Loading commit data... | |
| release_candidates.json | Loading commit data... | |
| release_priority.md | Loading commit data... | |
| resplit_montreuil.py | Loading commit data... | |
| sample_classify.py | Loading commit data... | |
| sample_meta.py | Loading commit data... | |
| sample_ontology.py | Loading commit data... | |
| sample_panns.py | Loading commit data... | |
| sample_tfidf.json | Loading commit data... | |
| seed_edl_take89.py | Loading commit data... | |
| take-compare.html | Loading commit data... | |
| take_gig_map.md | Loading commit data... | |
| tidal_score.py | Loading commit data... | |
| tide.py | Loading commit data... | |
| tide_eda.py | Loading commit data... | |
| tokens.json | Loading commit data... | |
| track_recording_map.json | Loading commit data... | |
| triage.csv | Loading commit data... | |
| triage.html | Loading commit data... | |
| triangle.html | Loading commit data... |