Ground sample families by LISTENING, not by name. sample_classify.py runs laion/clap-htsat-unfused (transformers, torch CPU) over Dirt-Samples one-shots, scoring each against text prompts for the 12 fleet families; aggregates per folder (dominant + homogeneity → kits show as mixed). ffmpeg audio I/O, no librosa. validate/run/one commands; validate measures top-1 vs the name-confident folders. Finding (validate): 58% top-1 agreement with the name-heuristic at fine 12-way. KEY: the name 'ground truth' itself is wrong in many disagreements — CLAP correctly calls 808hc/808mc congas (perc), which the name-classifier mislabeled bass via '808'. CLAP is near-perfect on vox/break/clear-bass/kick/keys; the genuinely fuzzy zone is the melodic cluster (synth/lead/keys/pad). Prompt-tuning is whack-a-mole on noisy truth. Conclusion: trust CLAP coarsely, not at fine 12-way silently.
| Name |
Last commit
|
Last update |
|---|---|---|
| .. | ||
| punkachien | Loading commit data... | |
| sources | Loading commit data... | |
| tests | Loading commit data... | |
| .gitignore | Loading commit data... | |
| README.md | Loading commit data... | |
| ardour_stem_export.py | Loading commit data... | |
| audio_lens.py | Loading commit data... | |
| backlog_setlists.json | Loading commit data... | |
| backlog_setlists.py | Loading commit data... | |
| boundaries_take89_validated.json | Loading commit data... | |
| boundary_bleed.py | Loading commit data... | |
| boundary_bleed_take89.json | Loading commit data... | |
| build_catalog.py | Loading commit data... | |
| build_catalog_view.py | Loading commit data... | |
| build_corpus.py | Loading commit data... | |
| build_track_recording_map.py | Loading commit data... | |
| build_triage_ui.py | Loading commit data... | |
| catalog.authored.yaml | Loading commit data... | |
| catalog.generated.json | Loading commit data... | |
| catalog.yaml | Loading commit data... | |
| catalog_view.json | Loading commit data... | |
| corpus-viz-plan.md | Loading commit data... | |
| corpus.html | Loading commit data... | |
| eda_report.json | Loading commit data... | |
| edl_render.py | Loading commit data... | |
| gap-report.md | Loading commit data... | |
| locate-matrix.md | Loading commit data... | |
| master_align.json | Loading commit data... | |
| master_edl_take89.json | Loading commit data... | |
| models.py | Loading commit data... | |
| pattern_ngrams.py | Loading commit data... | |
| pattern_registry.json | Loading commit data... | |
| performance_notes.md | Loading commit data... | |
| release_candidates.json | Loading commit data... | |
| release_priority.md | Loading commit data... | |
| resplit_montreuil.py | Loading commit data... | |
| sample_classify.py | Loading commit data... | |
| sample_tfidf.json | Loading commit data... | |
| seed_edl_take89.py | Loading commit data... | |
| take-compare.html | Loading commit data... | |
| take_gig_map.md | Loading commit data... | |
| tidal_score.py | Loading commit data... | |
| tide.py | Loading commit data... | |
| tide_eda.py | Loading commit data... | |
| tokens.json | Loading commit data... | |
| track_recording_map.json | Loading commit data... | |
| triage.csv | Loading commit data... | |
| triage.html | Loading commit data... | |
| triangle.html | Loading commit data... |