Abstract

This paper investigates how segment-based audio retrieval can support professional music curation through Talaria, a human-centered system that allows curators to query with perceptually salient excerpts (e.g., choruses, vocal lines) and refine results via acoustic and editorial filters to prioritize control, flexibility, and interpret ability over automated recommendations. Developed through a Research through Design (RtD) process and evaluated with fourteen professionals from a major Asian streaming platform, the study combined semi-structured interviews, a scenario-based questionnaire, and usability testing to examine three research questions: (i) how segment-based retrieval supports creativity and control; (ii) where it is most effective or inadequate; and (iii) which design factors shape trust in AI-assisted tools. Findings show that Talaria enhances early-stage exploration and mid-stage stylistic refinement, excelling in perceptual tasks such as playlist flow optimization and context-specific selection, while exhibiting limits in concept-driven work that depends on symbolic reasoning or lyrical interpretation (e.g., artist branding, narrative playlists). Trust hinged on perceived consistency, transparency, and editorial autonomy, and was strengthened by lightweight, interaction-level explanations that clarify similarity logic without constraining judgment. We distill design implications for human-centered curation systems: treat interaction-level interpret ability as a first-class requirement, preserve user agency through non-prescriptive controls, adopt genre-aware segmentation and preview strategies, and support fluid, iterative workflows. The work offers design evidence that segment-level retrieval can augment professional curation when interpret ability and agency are built into the interaction.

Keywords

Segment-based retrieval; Human-centered AI; Music curation; Interpretability

Creative Commons License

Creative Commons Attribution-NonCommercial 4.0 International License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Conference Track

Track 4 - Human-Centered AI

Share

COinS
 
Dec 2nd, 9:00 AM Dec 5th, 5:00 PM

Talaria: Designing Segment-Based Audio Retrieval to Support Human-Centered Music Curation

This paper investigates how segment-based audio retrieval can support professional music curation through Talaria, a human-centered system that allows curators to query with perceptually salient excerpts (e.g., choruses, vocal lines) and refine results via acoustic and editorial filters to prioritize control, flexibility, and interpret ability over automated recommendations. Developed through a Research through Design (RtD) process and evaluated with fourteen professionals from a major Asian streaming platform, the study combined semi-structured interviews, a scenario-based questionnaire, and usability testing to examine three research questions: (i) how segment-based retrieval supports creativity and control; (ii) where it is most effective or inadequate; and (iii) which design factors shape trust in AI-assisted tools. Findings show that Talaria enhances early-stage exploration and mid-stage stylistic refinement, excelling in perceptual tasks such as playlist flow optimization and context-specific selection, while exhibiting limits in concept-driven work that depends on symbolic reasoning or lyrical interpretation (e.g., artist branding, narrative playlists). Trust hinged on perceived consistency, transparency, and editorial autonomy, and was strengthened by lightweight, interaction-level explanations that clarify similarity logic without constraining judgment. We distill design implications for human-centered curation systems: treat interaction-level interpret ability as a first-class requirement, preserve user agency through non-prescriptive controls, adopt genre-aware segmentation and preview strategies, and support fluid, iterative workflows. The work offers design evidence that segment-level retrieval can augment professional curation when interpret ability and agency are built into the interaction.

 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.