writings at variogr.am
On this section
Research statement
From 2001 until 2005 I taught computers how to hear music at the MIT Media Lab, working in the Music, Mind and Machine (formerly "Machine Listening")
group with Barry Vercoe. Previously I was at the NEC Research Institute in Princeton working on Minnowmatch, a machine listening
engine. Even earlier, I was working in the Natural Language Processing group at Columbia University.
I want to help people find music, and I want artists and labels to find people. Our work concentrates on both the content and
culture of music through signal processing, data mining, language analysis and machine learning techniques. We can claim higher
accuracy in music retrieval tasks such as recommendation and similarity over the currently-popular marketing influenced
collaborative filtering approaches by looking at both the signal and the community response to the signal. I also work on
learning a a query-by-description front end for music retrieval by correlating music with free text (community metadata.)
We can predict a community's reaction to a new piece of music well enough to create personalized automatic record reviews.
This can level the field for independent under-marketed musicians by having a bias-free musical intelligence perform our
filtering. Instead of "Rock and Pop" we can create meaningful similarity clusters of music that are tuned to individual tastes
and styles.
My other research interests include generative perceptual synthesis, text retrieval and understanding, audio and visual
scene analysis, time-aware kernel approaches to machine learning, and embedded sound hardware.
Short texts, interviews and summaries
Academic papers and talks
- Whitman, Brian. Learning the meaning of music. Dissertation: MIT Department of Architecture
(Program in Media Arts and Sciences.) Web
link.
- Dobson, Kelly, Brian Whitman and Daniel P.W. Ellis.
"Learning Auditory Models of Machine Voices." To appear in the 2005 Workshop on Applications of Signal Processing to Audio
and Acoustics. [ paper PDF ]
- Whitman, Brian. Learning the meaning of music. Dissertation defense,
MIT, April 14 2005. [ 7MB talk PDF ]
- Whitman, Brian, Daniel P.W. Ellis. "Automatic
Record Reviews." In Proceedings of ISMIR 2004 - 5th International Conference on Music Information
Retrieval. October 10-14, 2004, Barcelona, Spain. [ paper PDF | talk PDF ]
-
Berenzweig, Adam,
Beth Logan,
Daniel Ellis, Brian Whitman.
"A Large Scale Evaluation of Acoustic and Subjective Music Similarity Measures."
Computer Music Journal,
Summer 2004, 28(2), pp 63-76. [ web ]
- Whitman, Brian. Location One Gallery, NYC, April 28 2004: "Music to Computers." [ talk PDF ]
- Whitman, Brian. ADVENT Seminar @ Columbia University, March 5 2004: "Learning the Meaning of Music." [ talk PDF ]
- Whitman, Brian. Dorkbot-NYC, March 3 2004: "Overfitting." [ talk PDF ]
- Berenzweig, Adam,
Daniel Ellis,
Beth
Logan, Brian Whitman.
"A Large Scale Evaluation of Acoustic and Subjective Music Similarity
Measures." In Proceedings of the 2003 International Symposium on Music Information Retrieval. 26-30 October 2003, Baltimore, MD.
[ paper PDF ]
- Whitman, Brian. "Semantic Rank Reduction of Music Audio." In Proceedings of the 2003 Workshop on Applications of
Signal Processing to Audio and Acoustics (WASPAA). 19-22 October 2003, New Paltz, NY. pp135-138 [ paper PDF
| talk PDF ]
- Recht, Ben and Brian Whitman.
"Musically Expressive Sound Textures from Generalized Audio." In Proceedings of the 2003 Digital Audio Effects (DAFX03)
Conference. 8-11 September 2003, Queen Mary, University of London, U.K. [ paper PDF ]
- Whitman, Brian. Tutorial: "Cultural and Acoustic Approaches to Music Retrieval and Understanding." 7 September 2003,
Digital Audio Effects Conference, Queen Mary, Univeristy of London, U.K. (Contact for slide notes.)
- Whitman, Brian, Deb Roy, and Barry Vercoe.
"Learning Word Meanings
and Descriptive Parameter Spaces from Music."
in Proceedings of the HLT-NAACL03 workshop on Learning Word Meaning
from Non-Linguistic Data. 26-31 May 2003, Edmonton, Alberta, Canada. [ paper PDF |
talk PDF ]
- Whitman, Brian and Ryan Rifkin. "Musical Query-by-Description as a
Multiclass Learning Problem." In Proceedings of the IEEE Multimedia Signal Processing Conference. 8-11 December
2002, St. Thomas, USA. [ paper PDF | talk
PDF ]
- Ellis, Daniel, Brian Whitman, Adam Berenzweig and Steve Lawrence. "The Quest For Ground Truth in
Musical Artist Similarity." In Proceedings of the 3rd International Conference on Music Information Retrieval.
13-17 October 2002, Paris, France. [ paper PDF | talk PDF ]
- Kim, Youngmoo and Brian Whitman. "Singer Identification in
Popular Music Recordings Using Voice Coding
Features." In Proceedings of the 3rd International Conference on Music Information Retrieval. 13-17 October 2002, Paris,
France. [ paper
PDF ]
- Whitman, Brian and Paris Smaragdis. "Combining Musical and
Cultural Features for Intelligent Style Detection." In Proceedings of the 3rd International Conference on Music
Information Retrieval. 13-17 October 2002, Paris, France. [ paper PDF | talk PDF ]
- Whitman, Brian and Steve Lawrence (2002). "Inferring Descriptions
and Similarity for Music from Community Metadata." In "Voices of Nature," Proceedings of the 2002 International Computer Music Conference. pp 591-598.
16-21 September 2002, Göteborg, Sweden. [ paper PDF | talk PDF ]
- Whitman, Brian, Gary Flake, and Steve Lawrence (2001, September
10-12). Artist Detection in Music with
Minnowmatch. In Proceedings of the 2001 IEEE Workshop on Neural Networks for Signal Processing, pp. 559-568. Falmouth,
Massachusetts. [ paper link | talk PDF ]