Halley Young - PhD Researcher

Halley Young wears many hats. What are you interested in?

Introduction

As a PhD researcher, my work focuses on the intersection of generative AI and the arts. I aim to pioneer individualized generative distributions in artistic creation through a combination of theoretical and technological advancements.

Work with Google DeepMind

I have worked extensively with Google DeepMind on issues of controllability in large language models for music. Specifically, I have worked on making the original Music Transformer more steerable for use by composers with clear goals in mind but who also want to be surprised and inspired by the creativity of the tool. Some of our work was published in IUI's HAI-GEN workshop..

Neurosymbolic Analysis of Images, Folk Music, and Poetry

My dissertation involves a neurosymbolic analysis of images, folk music, and poetry, identifying gaps in existing theoretical models. This led to the development of a neurosymbolic model that combines empirical data with domain-specific theory. This model introduces a new data structure called a “prototype graph," redefining musical and poetic form as relationships between prototype nodes and their associated spans. This work began with a 2019 ICML paper and was continued in a 2022 NEURIPS paper.

Electronic Music Theory

My research in electronic music examines the computational semantics and Turing completeness of Digital Audio Workstations, with a specific focus on Ableton. I have dissected the computational capabilities of DAWs, exploring their potential as complete computational systems and their role in shaping our approach to music creation and analysis.

In addition, I have proposed a novel method for analyzing DAW files, treating them as structured programs requiring a blend of program analysis, digital signal processing, and traditional harmonic and rhythmic analysis. This approach equates DAW files in electronic music to traditional scores in classical compositions.

For more detail on this work, see the following preprints:

On the semantics of Digital Audio Workstations as Program Interpreters
Automated Listening is More Meaningful with the Source Code (coauthored with Matthew Chung, University of California San Diego)
Undecidability Results in Modern Music Making

Identity-Driven Distribution Discovery in Poetry and Music

To put all my ideas together, I have used my knowledge of both traditional and deep-learning-based theories of poetry and various types of music to translate poetic and musical "identity statements" into AI-interpretable language. This has led to the creation of AI-generated poems and music that demonstrate a more nuanced and diverse range of artistic outputs, effectively creating a unique distribution of poetry or vocal music for each identity statement. For a comprehensive display of this identity-driven paradigm, go to my project webpage.

Future Work

Going forward, my research aims to establish a new paradigm in the generation of creative output, where individuals can use their unique identities to create art, experiences and productive tools that reflect their individual visions. The integration of theory, technology, and creativity will contribute significantly to the advancement of generative AI and open new avenues for personalized expression.

Publications

On the Semantics of Digital Audio Workstations as Program Interpreters
Preprint.
Halley Young.
Automated Listening is More Meaningful with the Source Code
Preprint.
Halley Young, Matthew Chung.
Undecidability Results in Modern Music Making
Preprint.
Halley Young.
Neurosymbolic Deep Generative Models for Sequence Data with Relational Constraints
NEURIPS 2022.
Halley Young, Maxwell Du and Osbert Bastani.
Compositional Steering of Music Transformers
HAI-GEN Workshop at IUI 2022.
Halley Young, Vincent Dumoulin, Pablo Castro, Jesse Engel, and Anna Huang.
A First-Order Theory of Film Scores for Generation from Lightweight Specifications
Programming Languages for Interactive Entertainment workshop at The Artificial Intelligence for Interactive Digital Entertainment Conference.
Halley Young.
A Foundations-of-Computation Approach to Formalizing Musical Analysis
Proceedings of the 2020-2021 International Computer Music Conference.
Halley Young.
Algorithm as Determinant of Form in Music
Proceedings of the 2019 International Conference on Computational Creativity.
Halley Young.
Learning Neurosymbolic Generative Models via Program Synthesis
Proceedings of the 2019 International Conference on Machine Learning (ICML).
Halley Young, Osbert Bastani, and Mayur Naik.
A categorial grammar for music and its use in automatic melody generation
Proceedings of the 5th ACM SIGPLAN International Workshop on Functional Art, Music, Modeling, and Design, 2017.
Halley Young.
A computational approach to the syntax and semantics of music
Bachelor's thesis supervised by Bud Mishra, 2017.
Halley Young.

Experience in Every Aspect of the Large Language Model Lifecycle

Building My Own LLMs

I have built LLMs from scratch for music generation, including developing a novel positional encoding system to account for the specifics of the domain in how sequences are represented. Some example outputs can be found here.

Work with Google DeepMind - Developing Models for Finetuning Existing LLMs

I have worked extensively with Google DeepMind on developing novel methods to steer expert-guided features of non-text-based transformers using variants of prompt and bias tuning. My work is published in the the HAI-GEN proceedings.

Exploring the Power of Existing Large Language Models

I have developed intensive pipelines, algorithms, and methods to convert ChatGPT output to creative artistry and software across many domains:

Identity-Driven Poetry

I have created a standalone website for exploring identity through creative computing. My software translates any identity statement or manifesto by a user into a distribution of diverse and high-quality art, including poetry.

Audio Songs with Music and Vocals

Electronic Music Created Using FoxDot Language

Renoise Software

Interactive Media Designed Using ChatGPT

Neurosymbolic Approaches: A Versatile Tool for Limited Data Contexts

The field of neurosymbolic computing represents a groundbreaking fusion of neural networks and symbolic reasoning, offering powerful solutions even without large amounts of data. This approach is especially useful in scenarios where data scarcity poses a challenge, providing a robust framework for understanding and generating complex structures.

Applications Across Diverse Domains

Vision: In the realm of computer vision, my work presented at ICML 2019 showcases the use of neurosymbolic models to effectively process and interpret visual data, even when large datasets are not available. This approach leverages the strengths of both neural networks and symbolic systems to create more efficient and interpretable models.
Music and NLP: Extending this methodology, my research presented at NEURIPS 2022 demonstrates the application of neurosymbolic models in music and natural language processing (NLP). By incorporating relational constraints and high-level structure, these models significantly enhance the generative process, capturing the intricacies of musical sequences and textual data with impressive coherence and depth.

The versatility of neurosymbolic models lies in their ability to integrate empirical data with structured, logical frameworks. This integration not only advances the field of AI but also opens up new avenues for creative expression and analysis in various domains.

Impact and Future Directions

My ongoing research in neurosymbolic computing aims to further refine these models, enhancing their applicability and effectiveness across different fields. The goal is to establish neurosymbolic models as a standard tool in scenarios where data limitations are a significant challenge, thereby broadening the horizons of AI research and its practical applications.

Contact

For inquiries or collaboration opportunities, please reach out at halleyy@seas.upenn.edu.

Neurosymbolic Theories of Form

My thesis proposes a groundbreaking neurosymbolic model for musical form, a pivotal innovation in understanding and generating harmonious compositions. This model bridges the gap between music theory and computer science, offering new insights into musical form through the lens of advanced data structures and algorithms.

Prototype Graph: I introduced a novel data structure, termed the prototype graph, which redefines musical form as interrelated prototype nodes. This structure is key to understanding and manipulating musical compositions both generatively and classificatorily.
Empirical Validation: By generating novel folk tunes using this model and comparing them to traditional and AI-generated music, I have empirically demonstrated the superiority of the prototype graph approach. This method has proven effective in capturing the complexity of musical forms beyond the scope of traditional "antecedent-consequent" models and other models of melodic structure.
Published Research: This innovative approach was recognized and published in NEURIPS, a leading venue in machine learning, underscoring its significance in the field.
Generative Applications: The prototype graph model has enabled the creation of musical pieces with unique harmonic attributes yet coherent internal structures, pushing the boundaries of what is achievable with machine learning in music generation.

Innovations in Music Theory: Empirical Testing and Generative Applications

My dissertation epitomizes my commitment to transforming academic music theory into practical, generative applications. This ambitious endeavor bridges the gap between theoretical concepts and their empirical validation and application.

ICMC 2021 Paper - "A Theory-of-Computation Approach to Formalizing Musical Analysis" A pioneering study using formal logic to define diverse musical genres, including world music and contemporary artists. This paper represents a crucial step in quantifying and systematizing music theory for generative purposes.
PLIE Paper - "A First-Order Theory of Film Scores for Generation from Lightweight Specifications": This work extends the theoretical frameworks to film scores, integrating film-score theory with empirical studies on music and emotion. It was an innovative attempt to generate music that aligns with cinematic narratives and emotional cues. More importantly, it represented the first serious attempt to use SMT solvers, which translate first order logic definitions (such as those from my ICMC paper) into concrete musical examples.

This facet of my research underscores a novel approach in music theory - using logical and mathematical models as a foundation for music generation, enabling a deeper understanding and exploration of music's emotional and theoretical dimensions.

Collaboration with Google DeepMind: Bridging Technology and Composition

My collaboration underscored my commitment to making generative music theory accessible and practical for composers.

Generative Music Theory: The project was centered around the concept of defining and manipulating key musical functions critical in MIDI composition. This approach empowers composers to harness technology for creative expression.
Functional Control in MIDI: I identified and listed essential functions in MIDI composition, developing a system that allows composers to control these elements precisely, thereby enriching the compositional process.
Technical and Artistic Achievement: This endeavor required not only a deep understanding of music theory and computer science but also the practical application of these theories. Writing substantial amounts of music to calibrate these functions exemplifies the synergy between theoretical knowledge and real-world application.

The outcome of this collaboration highlights the potential of technology to enhance musical creativity, providing composers with new tools to shape their musical ideas in the realm of automatic music generation.

Exploring the Power of Digital Audio Workstations in Music Innovation

My research delves deep into the capabilities and complexities of Digital Audio Workstations, revealing their immense potential in the realm of music theory and production.

Turing Completeness and Undecidability: Demonstrating that DAWs are not just powerful computational tools but also exhibit undecidability, this research brings to light the theoretical and practical implications of these platforms in music-making.
Mathematical and Musical Analysis: I advocate for a comprehensive approach to analyzing DAW files, combining program analysis, digital signal processing, and traditional harmonic and rhythmic analysis to unlock deeper insights into music composition and theory.
Empirical Research on DAWs: Having accumulated a vast corpus of over 2400 DAW programs, my ongoing work involves extracting and interpreting data from these files, uncovering patterns in the use of vocal samples, randomness, melodic complexity, and sound design.

This avenue of research not only highlights the importance of DAWs in modern music theory and production but also opens new pathways for understanding and appreciating genres like hip-hop and beatmaking, traditionally overlooked in academic studies.

Enhancing Sound Quality in Computer-Generated Music, Including Vocal Music

My research has made significant strides in improving the sound quality and interpretability of computer-generated music, focusing on the innovative use of DAWs for music generation.

Advancements in DAW Projects: Breaking the limitations of current text-to-music generators, my approach enables the creation of complex musical pieces with high-quality audio and vocal tracks, extending well beyond the typical 30-second limit.
High-Quality Samples and Sound-Synthesis: By employing top-tier samples and advanced sound-synthesis techniques, I have significantly reduced audio artifacts, resulting in clearer and more engaging musical compositions.
Integration with ChatGPT: Utilizing ChatGPT, I've developed a method to iteratively refine the high-level properties of DAW files and lyrics, which has been instrumental in completing intricate and nuanced musical pieces.

This work not only propels the field of music composition forward but also underscores the untapped potential of DAWs in enhancing the creative capabilities of composers across various genres, particularly in popular music with vocals.

Dissertation Work Analyzing Folk Music

In my dissertation, I embarked on a pioneering journey to reevaluate and redefine traditional Western models of melody structure in folk music. Integrating advanced computational models with a deep understanding of music theory and computer science, I developed a novel approach that challenges both conventional Western music theory models and purely theoretical models in musicology. This work aims to demonstrate the limitations of existing models and advocate for a more nuanced, computationally informed approach to understanding and generating folk music melodies.

The heart of my research is the development of a neurosymbolic model for musical form, introducing the innovative 'prototype graph' data structure.

This prototype graph represents musical form as a network of relationships between 'prototype nodes' (specific musical elements) and the music they represent, as well as the music-theoretic relationships between musical spans and their respective nodes. Understanding these relationships, both generative and classificational, is key to grasping the model's capacity to create and categorize musical structures.

My hypothesis challenged two prevailing models: the traditional "antecedent-consequent" or "period/sentence" models found in Western music theory, and neural models devoid of music-theoretic grounding. I posited that these models were inadequate in capturing the diversity and complexity of forms found in folk music.

Methodology: Employ the prototype graph model to generate novel folk tunes.
Evaluation: Compare these against tunes generated by traditional and neural models.
Results: The neurosymbolic model consistently outperformed other approaches, suggesting a more robust approach to understanding musical form within folksong entails combining data-driven insights with interpretable music theories.

A significant aspect of my research involved applying novel harmonic constraints to the generated music, diverging from typical characteristics seen in folk music. This included variations inspired by composers like Debussy and Messiaen, challenging the model's ability to maintain formal coherence while exploring the essence of a folk song's appeal.

Formalizing Comparative Musicology and Empiricizing Music Theory

In addition to my dissertation work, I have delved into the fascinating world of comparative musicology, using formal logic to uncover previously unobserved parallels and distinct divergences in global musical traditions. This exploration spans a wide array of musical styles, from the serialist melodic contours of Robert Morris to the rhythmic intricacies in the performances of Armenian artist Tigran Hamasyan.

Using first-order logic as a primary tool, my research systematically compares different musical traditions, offering new perspectives on the universal and unique aspects of global music.

The cornerstone of this research is the application of first-order logic to dissect and understand complex musical concepts, allowing for a systematic comparison of different musical traditions and facilitating a nuanced understanding of their underlying structures. For instance, I contrast formal definitions of Western common practice music with aspects of music from other cultures, such as Hindustani gamaka.

Objective: Illustrate both the shared foundations and unique characteristics of diverse musical cultures.
Methodology: Apply SMT solvers to process first-order logic specifications.
Theories Studied: Topic theory, theories of musical emotion, traditional Western harmony, and approaches to world music.

In my publication of this work at the Programming Languages and Interactive Entertainment (PLIE) workshop, I demonstrated the usefulness of SMT solvers, an advanced technique in applied logic, in automatically converting formal theories into actual music. These SMT solvers take in first-order logic specifications and output specific “world models” - in this case, pieces of music - that satisfy those specifications. This approach enables a precise analysis of how specific musical features evoke certain cultural contexts or themes.

Unveiling the Complexity and Nuances in Electronic Music through Digital Audio Workstation Analysis

Digital Audio Workstations like Ableton or Logic represent a fusion of advanced computing and creative music production, offering an unprecedented scope for musical innovation. I began this research by establishing that DAWs are Turing complete, highlighting their immense computational power and versatility. Next, I demonstrated the undecidability of common queires about DAW files in various music-making modes, discovering fundamental boundaries to our ability to understand their capabilities.

I explore the intricate layers of electronic compositions through a comprehensive method combining program analysis, digital signal processing, and traditional harmonic and rhythmic analyses.

The proposed method for analyzing DAW files treats them as complex programs. The combination of program analysis, digital signal processing, and traditional harmonic and rhythmic analyses is pivotal in understanding the creative and technical aspects of DAW-based music.

Focus: Examine elements such as the use of vocal samples, incorporation of randomness, and the balance between melodic complexity and sound design.
Implications: Insights into evolving practices in electronic music production and the relationship between production techniques and what the audience ultimately hears.
Cultural Significance: New understanding of the artistic merit of genres of music less commonly studied in the academy.

I built a database of over 2400 DAW programs, then applied program analysis and digital signal processing to extract musical insights from the database. I then began developing a music-theoretic language for discussing DAW-produced music. My hope is that having a language equal in sophistication to Western score notation will inspire more work understanding these popular but understudied genres.

If you haven't already, head to my project website!

Introduction to Artistic Identity Exploration

This research venture embarks on an ambitious goal: enabling artists across various mediums to harness technology and theory for creating unique expressions true to their personal visions. It's a journey through the realms of digital art, music, narrative, and poetry.

Artists are empowered to articulate their 'aesthetic identities' through personalized manifestos. These statements serve as a blueprint for AI to generate works that are authentic reflections of the artist's individuality. The ultimate objective is a comprehensive system that fosters a new era of identity-based artistic generation, offering a platform for artists to produce deeply personal and technically sound art.

Identity-Based Computing in Artistic AI

A pivotal discovery in my research, 'identity-based computing' represents a breakthrough in the fusion of AI and art. This concept involves utilizing 'aesthetic identity statements' to fine-tune AI models like ChatGPT, enabling the creation of art that not only showcases technical excellence but deeply resonates with the artist's personal vision and identity.

By defining their aesthetic identities, artists guide AI in producing works that are true reflections of their unique perspectives. This approach marries the precision and capabilities of AI with the nuanced and subjective realm of artistic creation, opening up new avenues for personalized art generation.

Harnessing AI for Personalized Creativity

Enabling artists to express their unique aesthetic visions through AI
Blending cutting-edge AI technology with individual artistic expression
Expanding the scope of AI in generating diverse and personal art forms

This innovative approach is not just a technological advancement but also a step towards understanding and appreciating the role of personal identity in the realm of art and creativity.

Collaboration with Poet Charles Bernstein: AI and Identity in Art

My collaboration with Charles Bernstein, a distinguished poet known for his deep exploration of language and form, marks a pivotal moment in my research. This partnership has birthed a groundbreaking approach where poetic 'identity statements' are translated into AI-interpretable language, leading to the creation of AI-generated poetry.

This innovative method is centered on empowering artists to express their aesthetic identities, allowing AI to produce works that genuinely reflect these individual visions. Bernstein's involvement in this project has been especially insightful. Through his experimentation with different personas and styles, such as embracing Yeats' poetic style or developing unique manifestos, he has effectively showcased the profound influence of identity on creative output.

This journey not only highlights the capabilities of AI in artistic creation but also illuminates the significant role of personal identity in the realm of creative expression.

Technical Contributions: Identity Statements and Generative Models

A key technical achievement of our research is the transformation of each identity statement into a unique generative model distribution. This innovative approach allows us to derive an unlimited array of artistic pieces from a single identity statement, effectively creating a boundless spectrum of artistic expression tailored to individual identities.

Crafting the language of these identity statements requires extensive domain expertise. The challenge lies in translating a wide array of diverse and complex identity statements into art that authentically aligns with each identity. This process necessitates a deep understanding of both the artistic domain and the intricacies of AI technology.

Empirical Evidence of Alignment with Identity Statements

Development of a system that interprets and translates identity statements into generative art
Empirical evidence demonstrating close alignment of generated poetry with the tested identity statements
Continued exploration and refinement in other artistic domains beyond poetry

Our empirical findings, particularly in the poetry domain, provide substantial evidence that the generated output closely aligns with the identity statements. This not only validates our approach but also paves the way for further exploration and refinement in other artistic domains.

Future Collaborations and Expanding Horizons

Looking ahead, my ambition is to forge partnerships with art therapists and music therapists, and to deepen engagement with professional artists across a wide range of disciplines. This research is not just about pushing the boundaries of artistic creation; it also explores the therapeutic potential of AI-enhanced art for personal healing and expression.

The versatility of our developed system allows it to cater to a wide array of artistic and therapeutic needs. From facilitating the philosophical explorations of renowned poets to providing a medium for young individuals facing personal challenges, this technology adapts to the unique requirements of its users. The system’s adaptive quality highlights its capacity to tailor creative expression, meeting the diverse and deeply personal needs of a broad spectrum of individuals.

Expanding the Impact of AI in Art and Therapy

Integrating AI with therapy to provide solace and a voice for personal expression
Developing technology that adapts to the philosophical and emotional needs of artists and individuals
Striving to refine AI technology for broader societal, therapeutic, and aesthetic impact

My ongoing research is dedicated to refining this technology, broadening its influence and utility, and unlocking new dimensions of the human experience through the integration of AI and identity exploration in both artistic and therapeutic contexts.