teaching Archives - Page 2 of 19

January 18, 2015April 28, 2016

ep-341 syllabus – Spring 2015

Programming interactive audio software and plugins in Max/MSP

Spring 2015

teacher: Tom Zicarelli – http://tomzicarelli.com

Office hours: Tuesday 1-2 PM, or Tuesday 4-5PM, at the EPD office #401 at 161 Mass Ave. Please email or call ahead.

Assignments and class notes will be posted to this blog: https://reactivemusic.net before or after the class. Search for: ep-341 to find the notes

Examples, software, links, and references demonstrated in class are available for you to use. If there is something missing from the notes, please ask about it. This is your textbook.

Syllabus:

Prototyping is the focus. Max is a seed that has grown into music, art, discoveries, products, and entire businesses.

After you take the course, you will have developed several projects. You might design a musical instrument or a plugin. You will have opportunities to solve problems. But mostly you will have a sense of how to explore possibilities by building prototypes in Max. You will have the basic skills to quickly make software to connect things, and answer questions like, “Is it possible to make something that does x?”.

You will become familiar with how other artists use Max to make things. You will be exposed to to a world of possibilities – which you may embrace or reject.

We will explore a range of methods and have opportunities to use them in projects. We’ll look at examples by artists – asking the question: How does this work?

Success depends on execution as well as good ideas.

Topics: (subject to change)

Max
Reverse engineering
Transforming and scaling data
Designing user interfaces
Messages and communication, MIDI/OSC
randomness and probability
Connecting hardware and other devices
Working with sensors, data, and API’s
Audio signal processing and synthesis.
Problem solving, prototyping, portfolios.
plugins, Max for Live.
Basic video processing and visualization
Alternative tools: Pd
Max externals
How to get ideas
Computers and Live performance
Transcoding

Grading and projects:

Grades will be assigned projects, several small assignments/quizzes, and class participation. Please see Neil Leonard’s EP-341 syllabus for details. I encourage and will give credit for: collaboration with other students, outside projects, performances, independent projects, and anything else that will encourage your growth and success.

I am open to alternative projects. For example, if you want to use this course as an opportunity to develop a larger project or continue a work in progress.

Reference material

https://cycling74.com/wiki/index.php?title=Max_Documentation_and_Resources

Max documentation, examples, and lessons (accessed from the ‘Help’ menu)
Google (search for: max/msp) http://www.google.com
cycling74.com (community, forums) https://cycling74.com/community/
Youtube videos
- dude837 https://www.youtube.com/user/dude837
- many others
Peter Elsea http://peterelsea.com/maxtutorials.html
Chris Dobrian
CCRMA
“Designing Sound” By Andy Farnell (Pure Data)
Github https://github.com
This website: https://reactivemusic.net
Max externals
- CNMAT http://cnmat.berkeley.edu/library/max_msp_jitter_depot
- “Max/MSP Objects” By Eric Lyon

January 1, 2015January 1, 2015

Vocaloid tutorial

by soundwavescience

http://youtu.be/vcJDTDBWTrw

January 1, 2015June 22, 2015

New musical instruments

A presentation for Berklee BTOT 2015 http://www.berklee.edu/faculty

Around the year 1700, several startup ventures developed prototypes of machines with thousands of moving parts. After 30 years of engineering, competition, and refinement, the result was a device remarkably similar to the modern piano.

What are the musical instruments of the future being designed right now?

new composition tools,
reactive music,
connecting things,
sensors,
voices,
brains

Notes:

predictions?

Ray Kurzweil’s future predictions on a timeline: http://imgur.com/quKXllo (The Singularity will happen in 2045)

In 1965 researcher Herbert Simon said: “Machines will be capable, within twenty years, of doing any work a man can do”. Marvin Minsky added his own prediction: “Within a generation … the problem of creating ‘artificial intelligence’ will substantially be solved.” https://forums.opensuse.org/showthread.php/390217-Will-computers-or-machines-ever-become-self-aware-or-evolve/page2

Patterns

Are there patterns in the ways that artists adapt technology?

For example, the Hammond organ borrowed ideas developed for radios. Recorded music is produced with computers that were originally as business machines.

Instead of looking forward to predict future music, lets look backwards to ask,”What technology needs to happen to make musical instruments possible?” The piano relies upon a single-escapement (1710) and later a double-escapement (1821). Real time pitch shifting depends on Fourier transforms (1822) and fast computers (~1980).

Artists often find new (unintended) uses for tools. Like the printing press.

New pianos

The piano is still in development. In December 2014, Eren Başbuğ composed and performed music on the Roli Seaboard – a piano keyboard made of 3 dimensional sensing foam:

Here is Keith McMillen’s QuNexus keyboard (with Polyphonic aftertouch):

https://www.youtube.com/watch?v=bry_62fVB1E

Experiments

Here are tools that might lead to new ways of making music. They won’t replace old ways. Singing has outlasted every other kind of music.

These ideas represent a combination of engineering and art. Engineers need artists. Artists need engineers. Interesting things happen at the confluence of streams.

Analysis, re-synthesis, transformation

Computers can analyze the audio spectrum in real time. Sounds can be transformed and re-synthesized with near zero latency.

Infinite Jukebox

Finding alternate routes through a song.

by Paul Lamere at the Echonest

Echonest has compiled data on over 14 million songs. This is an example of machine learning and pattern matching applied to music.

http://labs.echonest.com/Uploader/index.html

Try examples: “Karma Police”, Or search for: “Albert Ayler”)

Analyze your own music: https://reactivemusic.net/?p=18026

Remixing a remix

“Mindblowing Six Song Country Mashup”: https://www.youtube.com/watch?v=FY8SwIvxj8o (start at 0:40)

Local file: Max teaching examples/new-country-mashup.mp3

More about Echonest

Music Machinery by Paul Lamere: http://musicmachinery.com
Echonest segment analysis player: https://reactivemusic.net/?p=6296

Feature detection

Looking at music under a microscope.

removing music from speech

First you have to separate them.

SMS-tools

by Xavier Serra and UPF

Harmonic Model Plus Residual (HPR) – Build a spectrogram using STFT, then identify where there is strong correlation to a tonal harmonic structure (music). This is the harmonic model of the sound. Subtract it from the original spectrogram to get the residual (noise).

Settings for above example:

Window size: 1800 (SR / f0 * lobeWidth) 44100 / 200 * 8 = 1764
FFT size: 2048
Mag threshold: -90
Max harmonics: 30
f0 min: 150
f0 max: 200

Many kinds of features

Low level features: harmonicity, amplitude, fundamental frequency
high level features: mood, genre, danceability

Examples of feature detection

Acoustic Brainz: https://reactivemusic.net/?p=17641 (typical analysis page)
Freesound (vast library of sounds): https://www.freesound.org – look at “similar sounds”
Essentia (open source feature detection tools) https://github.com/MTG/essentia
“What We Watch” – Ethan Zuckerman https://reactivemusic.net/?p=10987

Music information retrieval

Finding the drop

“Detetcting Drops in EDM” – by Karthik Yadati, Martha Larson, Cynthia C. S. Liem, Alan Hanjalic at Delft University of Technology (2014) https://reactivemusic.net/?p=17711

Polyphonic audio editing

Blurring the distinction between recorded and written music.

Melodyne

by Celemony

http://www.celemony.com/en/start

A minor version of “Bohemian Rhapsody”: http://www.youtube.com/watch?v=voca1OyQdKk

Music recognition

“How Shazam Works” by Farhoud Manjoo at Slate: https://reactivemusic.net/?p=12712, “About 3 datapoints per second, per song.”

Music fingerprinting: https://musicbrainz.org/doc/Fingerprinting
Humans being computers. Mystery sounds. (Local file: Desktop/mystery sounds)
Is it more difficult to build a robot that plays or one that listens?

Sonographic sound processing

Transforming music through pictures.

by Tadej Droljc

https://reactivemusic.net/?p=16887

(Example of 3d speech processing at 4:12)

local file: SSP-dissertation/4 – Max/MSP/Jitter Patch of PV With Spectrogram as a Spectral Data Storage and User Interface/basic_patch.maxpat

Try recording a short passage, then set bound mode to 4, and click autorotate

Spectral scanning in Ableton Live:

http://youtu.be/r-ZpwGgkGFI

Web Audio

Web browser is the new black

Noteflight

by Joe Berkowitz

http://www.noteflight.com/login

Plink

by Dinahmoe

http://labs.dinahmoe.com/plink/

Can you jam over the internet?

What is the speed of electricity? 70-80 ms is the best round trip latency (via fiber) from the U.S. east to west coast. If you were jamming over the internet with someone on the opposite coast it might be like being 100 ft away from them in a field. (sound travels 1100 feet/second in air).

Global communal experiences – Bill McKibben – 1990 “The Age of Missing Information”

More about Web Audio

A quick Web Audio introduction: https://reactivemusic.net/?p=17600
Gibber by Charlie Roberts http://gibber.mat.ucsb.edu/

Conversation with robots

Computers finding meaning

The Google speech API

https://reactivemusic.net/?p=9834

The Google speech API uses neural networks, statistics, and large quantities of data.

Microsoft: real-time translation

German/English http://digg.com/video/heres-microsoft-demoing-their-breakthrough-in-real-time-translated-conversation
Skype translator – Spanish/English: http://www.skype.com/en/translator-preview/

Reverse entropy

InstantDecomposer

Making music from from sounds that are not music.

by Katja Vetter

. (InstantDecomposer is an update of SliceJockey2): http://www.katjaas.nl/slicejockey/slicejockey.html

local: InstantDecomposer version: tkzic/pdweekend2014/IDecTouch/IDecTouch.pd
local: slicejockey2test2/slicejockey2test2.pd

More about reactive music

RJDJ apps – create personal soundtracks from the environment
“Lyrebirds” by Christopher Lopez https://www.youtube.com/watch?v=Ouws45R2iXg

Sensors and sonification

Transforming motion into music

Three approaches

earcons (email notification sound)
models (video game sounds)
parameter mapping (Geiger counter)

Leap Motion

camera based hand sensor

“Muse” (Boulanger Labs) with Paul Bachelor, Christopher Konopka, Tom Shani, and Chelsea Southard: https://reactivemusic.net/?p=16187

Max/MSP piano example: Leapfinger: https://reactivemusic.net/?p=11727

local file: max-projects/leap-motion/leapfinger2.maxpat

Internet sensors project

Detecting motion from the Internet

https://reactivemusic.net/?p=5859

Twitter streaming example

https://reactivemusic.net/?p=5786

MBTA bus data

Sonification of Mass Ave buses, from Harvard to Dudley

https://reactivemusic.net/?p=17524

Stock market music

https://reactivemusic.net/?p=12029

More sonification projects

Vine API mashup

By Steve Hensley

Using Max/MSP/jitter

local file: tkzic/stevehensely/shensley_maxvine.maxpat

Audio sensing gloves for spacesuits

By Christopher Konopka at future, music, technology

http://futuremusictechnology.com

Computer Vision

Sensing motion with video using frame subtraction

by Adam Rokhsar

https://reactivemusic.net/?p=7005

local file: max-projects/frame-subtraction

The brain

Music is stored all across the brain.

Mouse brain wiring diagram

The Allen institute

https://reactivemusic.net/?p=17758

“Hacking the soul” by Christof Koch at the Allen institute

(An Explanation of the wiring diagram of the mouse brain – at 13:33) http://www.technologyreview.com/emtech/14/video/watch/christof-koch-hacking-the-soul/

OpenWorm project

A complete simulation of the nematode worm, in software, with a Lego body (320 neurons)

: https://reactivemusic.net/?p=17744

AARON

Harold Cohen’s algorithmic painting machine

https://reactivemusic.net/?p=17778

Brain plasticity

A perfect pitch pill? http://www.theverge.com/2014/1/6/5279182/valproate-may-give-humans-perfect-pitch-by-resetting-critical-periods-in-brain

DNA

Could we grow music producing organisms? https://reactivemusic.net/?p=18018

Two possibilities

Rejecting technology?

An optimistic future?

There is a quickening of discovery: internet collaboration, open source, linux, github, r-pi, Pd, SDR.

“Robots and AI will help us create more jobs for humans — if we want them. And one of those jobs for us will be to keep inventing new jobs for the AIs and robots to take from us. We think of a new job we want, we do it for a while, then we teach robots how to do it. Then we make up something else.”

“…We invented machines to take x-rays, then we invented x-ray diagnostic technicians which farmers 200 years ago would have not believed could be a job, and now we are giving those jobs to robot AIs.”

Kevin Kelly – January 7, 2015, reddit AMA http://www.reddit.com/r/Futurology/comments/2rohmk/i_am_kevin_kelly_radical_technooptimist_digital/

Will people be marrying robots in 2050? http://www.livescience.com/1951-forecast-sex-marriage-robots-2050.html

“What can you predict about the future of music” by Michael Gonchar at The New York Times https://reactivemusic.net/?p=17023

Jim Morrison predicts the future of music:

More areas to explore

NIME (New interfaces for musical expression) http://en.wikipedia.org/wiki/New_Interfaces_for_Musical_Expression
Immersive virtual musical instruments http://en.wikipedia.org/wiki/Immersive_virtual_musical_instrument
I’m thinking of something: http://imthinkingofsomething.com

January 1, 2015January 15, 2015

Hearing voices

A presentation for Berklee BTOT 2015 http://www.berklee.edu/faculty

(KITT dashboard by Dave Metlesits)

The voice was the first musical instrument. Humans are not the only source of musical voices. Machines have voices. Animals too.

Topics

synthesizing voices (formant synthesis, text to speech, Vocaloid)
processing voices (pitch-shifting, time-stretching, vocoding, filtering, harmonizing),
voices of the natural world
fictional languages and animals
accents
speech and music recognition
processing voices as pictures
removing music from speech
removing voices

Voices

We instantly recognize people and animals by their voices. As an artist we work to develop our own voice. Voices contain information beyond words. Think of R2D2 or Chewbacca.

There is also information between words: “Palin Biden Silences” David Tinapple, 2008: http://vimeo.com/38876967

Synthesizing voices

The vocal spectrum

What’s in a voice?

Formant synthesis in Max by Mark Durham: https://reactivemusic.net/?p=9294 (singing vowels with formants)
Formant synthesis Tutorial by Jordan Smith: https://reactivemusic.net/?p=9290 (making consonants with noise)

Singing chords

Humans acting like synthesizers.

Singing chords: Lalah Hathaway https://www.youtube.com/watch?v=c5AdOZtRdfE (0:30)
Tuvan throat singing: https://www.youtube.com/watch?v=5wHbIWH_NGc (near the end of the video)
Polyphonic overtone singing: Anna-Maria Hefele https://www.youtube.com/watch?v=vC9Qh709gas

More about formants

Formants (Wikipedia) http://en.wikipedia.org/wiki/Formant
Rooms have resonances: “I am sitting in a Room” by Alvin Lucier
Singer’s formant (2800-3400Hz).

Text to speech

Teaching machines to talk.

phonemes (unit of sound)
diphones (combination of phonemes) (Mac OS “Macintalk 3 pro”)
morphemes (unit of meaning)
prosody (musical quality of speech)

Methods

articulatory (anatomical model)
formant (additive synthesis) (speak and spell)
concatentative (building blocks) (Mac Os)

Try the ‘say’ command (in Mac OS terminal), for example: say hello

More about text to speech

History of speech synthesis http://research.spa.aalto.fi/publications/theses/lemmetty_mst/chap5.html (Helsinki University of Technology 1999)
Speech synthesizers, 2014 https://reactivemusic.net/?p=18141
Speech synthesis web API https://reactivemusic.net/?p=18138

Vocoders

Combining the energy of voice with musical instruments (convolution)

Peter Frampton “talkbox”: https://www.youtube.com/watch?v=EqYDQPN_nXQ (about 5:42) – Where is the exciting audience noise in this video?
Ableton Live example: Local file: Max/MSP: examples/effects/classic-vocoder-folder/classic_vocoder.maxpat
Max vocoder tutorial (In the frequency domain), by dude837 – Sam Tarakajian https://reactivemusic.net/?p=17362 (local file: dude837/4-vocoder/robot-master.maxpat

More about vocoders

How vocoders work, by Craig Anderton: https://reactivemusic.net/?p=17218
Wikipedia: http://en.wikipedia.org/wiki/Vocoder. Engineers conserving information to reduce bandwith
Heterodyne filter: https://reactivemusic.net/?p=17338 – digital emulation of an analog filter bank.
Max/MSP: examples/effects/classic-vocoder-folder/classic_vocoder.maxpat

Vocaloid

By Yamaha

(text + notation = singing)

Vocaloid website: http://www.vocaloid.com/en/
Hatsune Miku: https://reactivemusic.net/?p=6891

Demo tracks: https://www.youtube.com/watch?v=QWkHypp3kuQ

Vocaloid tutorial
- #1 https://www.youtube.com/watch?v=vcJDTDBWTrw (entering notes and lyrics – 1:25)
- #2 https://www.youtube.com/watch?v=qpGwgIyMGOk (raw sound – 0:42)
- #5 https://www.youtube.com/watch?v=YEAuL6Q2j-0 (with phrasing, vibrato, etc.,- 1:00)

Vocaloop device http://vocaloop.jp/ demo: https://www.youtube.com/watch?v=xLpX2M7I6og#t=24

Processing voices

Transformation

Pitch transposing a baby https://reactivemusic.net/?p=2458

Real time pitch shifting

Autotune: “T-Pain effect” ,(I-am-T-Pain bySmule), “Lollipop” by Lil’ Wayne. “Woods” by Bon Iver https://www.youtube.com/watch?v=1_cePGP6lbU

Autotuna in Max 7

by Matthew Davidson

Local file: max-teaching-examples/autotuna-test.maxpat

InstantDecomposer in Pure Data (Pd)

by Katja Vetter

http://www.katjaas.nl/slicejockey/slicejockey.html

Autocorrelation: (helmholtz~ Pd external) “Helmholtz finds the pitch” http://www.katjaas.nl/helmholtz/helmholtz.html

(^^ is input pitch, preset #9 is normal)

local file: InstantDecomposer version: tkzic/pdweekend2014/IDecTouch/IDecTouch.pd
local file: slicejockey2test2/slicejockey2test2.pd

Phasors and Granular synthesis

Disassembling time into very small pieces

sorting noise; http://youtu.be/kPRA0W1kECg
Phasors: https://reactivemusic.net/?p=17353

Time-stretching

Adapted from Andy Farnell, “Designing Sound”

https://reactivemusic.net/?p=11385 Download these patches from: https://github.com/tkzic/max-projects folder: granular-timestretch

Basic granular synthesis: graintest3.maxpat
Time-stretching: timestretch5.maxpat

More about phasors and granular synthesis

Shepard tone upward glissando by Chris Dobrian: https://reactivemusic.net/?p=17255
“Falling Falling” (Visual Shepard tone) https://reactivemusic.net/?p=17251
Ableton Live – granulator (Robert Henke)

Phase vocoder

…coming soon

Sonographic sound processing

Changing sound into pictures and back into sound

by Tadej Droljc

https://reactivemusic.net/?p=16887

(Example of 3d speech processing at 4:12)

local file: SSP-dissertation/4 – Max/MSP/Jitter Patch of PV With Spectrogram as a Spectral Data Storage and User Interface/basic_patch.maxpat

Try recording a short passage, then set bound mode to 4, and click autorotate

Speech to text

Understanding the meaning of speech

The Google Speech API

A conversation with a robot in Max

https://reactivemusic.net/?p=9834

Google speech uses neural networks, statistics, and large quantities of data.

More about speech to text

Real time German/English translator (Microsoft) http://digg.com/video/heres-microsoft-demoing-their-breakthrough-in-real-time-translated-conversation
Skype translator – Spanish/English: http://www.skype.com/en/translator-preview/
Dragon Naturally Speaking (Nuance) accidentally converts music to poetry

Voices of the natural world

Changes in the environment reflected by sound

Bernie Krause: “Soundscapes”
- The Voice of The Natural World: http://blog.ted.com/2013/06/12/the-voice-of-the-natural-world-bernie-krause-at-tedglobal-2013/
- TED: http://www.ted.com/talks/bernie_krause_the_voice_of_the_natural_world

Fictional languages and animals

“You can talk to the animals…”

Derek Abbot’s animal noise page: http://www.eleceng.adelaide.edu.au/Personal/dabbott/animal.html
Quack project http://www.quack-project.com/table.cgi
Fictional language dialog by Naila Burney: https://reactivemusic.net/?p=7242

Pig creatures example: http://vimeo.com/64543087

0:00 Neutral
0:32 Single morphemes – neutral mode
0:37 Series, with unifying sounds and breaths
1:02 Neutral, layered
1:12 Sad
1:26 Angry
1:44 More Angry
2:11 Happy

What about Jar Jar Binks?

Accents

The sound changes but the words remain the same.

The Speech accent archive https://reactivemusic.net/?p=9436

Finding and removing music in speech

We are always singing.

Jamming with speech

Drummer jams with a speed-talking auctioneer: https://reactivemusic.net/?p=7140
Guitarist imitates crying politician: http://digg.com/video/guitarist-plays-along-to-sobbing-japanese-politician

Removing music from speech

SMS-tools

by Xavier Serra and UPF

Settings for above example:

Window size: 1800 (SR / f0 * lobeWidth) 44100 / 200 * 8 = 1764
FFT size: 2048
Mag threshold: -90
Max harmonics: 30
f0 min: 150
f0 max: 200

feature detection

time dependent
Low level features: harmonicity, amplitude, fundamental frequency
high level features: mood, genre, danceability

Acoustic Brainz: (typical analysis page) https://reactivemusic.net/?p=17641

Essentia (open source feature detection tools) https://github.com/MTG/essentia

Freesound (vast library of sounds): https://www.freesound.org – look at “similar sounds”

Removing voices from music

A sad thought

phase cancellation encryption

This method was used to send secret messages during world war 2. Its now used in cell phones to get rid of echo. Its also used in noise canceling headphones.

https://reactivemusic.net/?p=8879

max-projects/phase-cancellation/phase-cancellation-example.maxpat

Center channel subtraction

What is not left and not right?

Ableton Live – utility/difference device: https://reactivemusic.net/?p=1498 (Allison Krause example)

Local file: Ableton-teaching-examples/vocal-eliminator

More experiments

Synthesizing laughter
Bobby McFerrin: (pentatonic scale) http://www.ted.com/talks/bobby_mcferrin_hacks_your_brain_with_music.html
Alphabet vocals
- jii lighter https://reactivemusic.net/?p=6970
- Sesame St – Joan LaBarbara: http://www.youtube.com/watch?v=y819U6jBDog
Warping acapella tracks https://reactivemusic.net/?p=18046

Questions

Why do most people not like the recorded sound of their voice?
Can voice be used as a controller?
- (Imitone: http://imitone.com)
- Mari Kimura
How do you recognize voices?
Does speech recognition work with singing?
How does the Google Speech API know the difference between music and speech?
How can we listen to ultrasonic animal sounds?
What about animal translators?

December 15, 2014December 15, 2014

ep-413 DSP week 15

Review

Syllabus: https://reactivemusic.net/?p=17122
Ways to approach a project https://reactivemusic.net/?p=17132
- Make machines that make art
- Reverse engineering
- Use the wrong tools
- Abstraction and destruction
- Backwards, extreme, opposite – connect two things
- Ask questions
Composition tools and dramatic shape https://reactivemusic.net/?p=17157
Problem solving (pitch detection) and prototyping (Muse) https://reactivemusic.net/?p=17159
Sound byte composition https://reactivemusic.net/?p=17190
Convolution and voices https://reactivemusic.net/?p=17211
(No class this week)
Granular synthesis, the frequency domain, and phasors https://reactivemusic.net/?p=17360
Data, Internet API’s, Vine API in Max https://reactivemusic.net/?p=17466
Communication, Osc, Sonification, MBTA API in Max https://reactivemusic.net/?p=17518
Filters: analog, digital, other, reversability https://reactivemusic.net/?p=17542
Web Audio API https://reactivemusic.net/?p=17600
Feature detection, and Music Information Retrieval https://reactivemusic.net/?p=17689
Waves: light, radio, water https://reactivemusic.net/?p=17787
This

Ideas

John Coltrane: You can learn something from everybody, no matter how good or bad they play, everybody has something to say.

Sal Khan: In the future people will take agency for their own education.

For artists, everything is a tool.

November 29, 2014December 11, 2014

ep-413 DSP – week 13

Feature detection and randomness

What could it possibly have to do with my life?

Feature detection

descriptors

Spectral
- BarkBands, MelBands, ERBBands, MFCC, GFCC, LPC, HFC, Spectral Contrast, Inharmonicity, and Dissonance, …
Time-domain
- EffectiveDuration, ECR, Loundness, …
Tonal
- PitchSalienceFunction, PitchYinFFT, HPCP, TuningFrequency, Key, ChordsDetection, …
Rhythm
- BestTrackerDegara, BeatTrackerMultiFeature, BPMHistogramDescriptors, NoveltyCurve, OnsetDescription, OnsetDetection, Onsets, …
SFX
- LogAttackTime, MaxToTotal, MinToTotal, TCToTotal, …
High-level
- Danceability, DynamicComplexity, FadeDetection, SBic, …

-from X. Serra (2014) “Audio Signal Processing for Music Applications”

low level vs. high level
single events vs. groups of events
combinations of descriptors
order of events (markov chains)

Humans are very good at pattern recognition. Is it a survival mechanism? People who listen to music are very good at analysis. Compared to the abilities of an average child, computer music information retrieval has not yet reached the computational ability of a worm: https://reactivemusic.net/?p=17744

Pattern recognition

(in computer science)

search engines
(search by image) https://reactivemusic.net/?p=9847
translators
warning systems
dolphin whistle https://reactivemusic.net/?p=10337
machine learning – facial recognition: https://reactivemusic.net/?p=17717

Music information retrieval

High-level

chills: https://reactivemusic.net/?p=17714
the drop https://reactivemusic.net/?p=17711
mood: https://reactivemusic.net/?p=17722
Bobby McFerrin https://reactivemusic.net/?p=555

Low-level

(These demonstrations will be done in Ubuntu Linux 14.04.1)

SMS tools https://reactivemusic.net/?p=17626
Essentia https://reactivemusic.net/?p=17656

1. Separating and removing musical tones from speech

1a. Harmonic plus residual model (HPR) in sms-tools (speech-female 150-200 Hz. , and sax phrase)

1b. Do the same thing with transformation model

What about pitch salience and chroma?

2. descriptors

(use workspace/A9)

import soundAnalysis as SA

Here is the list of descriptors that are donwloaded:

Index — Descriptor

0 — lowlevel.spectral_centroid.mean

1 — lowlevel.dissonance.mean

2 — lowlevel.hfc.mean

3 — sfx.logattacktime.mean

4 — sfx.inharmonicity.mean

5 — lowlevel.spectral_contrast.mean.0

6 — lowlevel.spectral_contrast.mean.1

7 — lowlevel.spectral_contrast.mean.2

8 — lowlevel.spectral_contrast.mean.3

9 — lowlevel.spectral_contrast.mean.4

10 — lowlevel.spectral_contrast.mean.5

11 — lowlevel.mfcc.mean.0

12 — lowlevel.mfcc.mean.1

13 — lowlevel.mfcc.mean.2

14 — lowlevel.mfcc.mean.3

15 — lowlevel.mfcc.mean.4

16 — lowlevel.mfcc.mean.5

2a. euclidian distance

What happens when you look at multiple descriptors

In [22]: SA.descriptorPairScatterPlot( ‘tmp’, descInput=(0,5))

2b. k- means clustering

What descriptors best classify sounds into instrument groups?

SA.clusterSounds(‘tmp’, nCluster = 3, descInput=[0,2,9])

In [21]: SA.clusterSounds(‘tmp’, nCluster = 3, descInput=[0,2,9])
(Cluster: 0) Using majority voting as a criterion this cluster belongs to class: violin
Number of sounds in this cluster are: 15
sound-id, sound-class, classification decision
[[‘61926’ ‘violin’ ‘1’]
[‘61925’ ‘violin’ ‘1’]
[‘153607’ ‘violin’ ‘1’]
[‘153629’ ‘violin’ ‘1’]
[‘153609’ ‘violin’ ‘1’]
[‘153608’ ‘violin’ ‘1’]
[‘153628’ ‘violin’ ‘1’]
[‘153603’ ‘violin’ ‘1’]
[‘153602’ ‘violin’ ‘1’]
[‘153601’ ‘violin’ ‘1’]
[‘153600’ ‘violin’ ‘1’]
[‘153610’ ‘violin’ ‘1’]
[‘153606’ ‘violin’ ‘1’]
[‘153605’ ‘violin’ ‘1’]
[‘153604’ ‘violin’ ‘1’]]

(Cluster: 1) Using majority voting as a criterion this cluster belongs to class: bassoon
Number of sounds in this cluster are: 22
sound-id, sound-class, classification decision
[[‘154336’ ‘bassoon’ ‘1’]
[‘154337’ ‘bassoon’ ‘1’]
[‘154335’ ‘bassoon’ ‘1’]
[‘154352’ ‘bassoon’ ‘1’]
[‘154344’ ‘bassoon’ ‘1’]
[‘154338’ ‘bassoon’ ‘1’]
[‘154339’ ‘bassoon’ ‘1’]
[‘154343’ ‘bassoon’ ‘1’]
[‘154342’ ‘bassoon’ ‘1’]
[‘154341’ ‘bassoon’ ‘1’]
[‘154340’ ‘bassoon’ ‘1’]
[‘154347’ ‘bassoon’ ‘1’]
[‘154346’ ‘bassoon’ ‘1’]
[‘154345’ ‘bassoon’ ‘1’]
[‘154353’ ‘bassoon’ ‘1’]
[‘154350’ ‘bassoon’ ‘1’]
[‘154349’ ‘bassoon’ ‘1’]
[‘154348’ ‘bassoon’ ‘1’]
[‘154351’ ‘bassoon’ ‘1’]
[‘61927’ ‘violin’ ‘0’]
[‘61928’ ‘violin’ ‘0’]
[‘153769’ ‘cello’ ‘0’]]

(Cluster: 2) Using majority voting as a criterion this cluster belongs to class: cello
Number of sounds in this cluster are: 23
sound-id, sound-class, classification decision
[[‘154334’ ‘bassoon’ ‘0’]
[‘61929’ ‘violin’ ‘0’]
[‘61930’ ‘violin’ ‘0’]
[‘153626’ ‘violin’ ‘0’]
[‘42252’ ‘cello’ ‘1’]
[‘42250’ ‘cello’ ‘1’]
[‘42251’ ‘cello’ ‘1’]
[‘42256’ ‘cello’ ‘1’]
[‘42257’ ‘cello’ ‘1’]
[‘42254’ ‘cello’ ‘1’]
[‘42255’ ‘cello’ ‘1’]
[‘42249’ ‘cello’ ‘1’]
[‘42248’ ‘cello’ ‘1’]
[‘42247’ ‘cello’ ‘1’]
[‘42246’ ‘cello’ ‘1’]
[‘42239’ ‘cello’ ‘1’]
[‘42260’ ‘cello’ ‘1’]
[‘42241’ ‘cello’ ‘1’]
[‘42243’ ‘cello’ ‘1’]
[‘42242’ ‘cello’ ‘1’]
[‘42253’ ‘cello’ ‘1’]
[‘42244’ ‘cello’ ‘1’]
[‘42259’ ‘cello’ ‘1’]]
Out of 60 sounds, 7 sounds are incorrectly classified considering that one cluster should ideally contain sounds from only a single class
You obtain a classification (based on obtained clusters and majority voting) accuracy of 88.33 percentage

2c. KNN find nearest neighbors

Which neighborhood or group does a sound belong to?

(using bad male vocal)

SA.classifySoundkNN(“qs/175454/175454/175454_2042115-lq.json”, “tmp”,13, descInput = [0,5,10])
===

from workspace…

First test: Use sounds by querying “saxophone”, tag=”alto-sax”
https://www.freesound.org/people/clruwe/sounds/119248/
I am using the same descriptors [0,2,9] that worked well in previous section. with K=3. Tried various values of K with this analysis and it always came out matching ‘violin’ which I think is correct.

In [26]: SA.classifySoundkNN(“qs/saxophone/119248/119248_2104336-lq.json”, “tmp”, 33, descInput = [0,2,9])
This sample belongs to class: violin
Out[26]: ‘violin’

Second test: I am trying out the freesound “similar sound” feature. Using one of the bassoon sounds I clicked “similar sounds” and chose a sound that was not a bassoon – “Bad Singer” (male).

http://freesound.org/people/sergeeo/sounds/175454/

Running the previous descriptors returned a match for violin. So I tried various other descriptors, and was able to get it to match bassoon consistently by using: [0,5,10] which are lowlevel.spectral_centroid.mean, lowlevel.spectral_contrast.mean.0, and lowlevel.mfcc.mean.0.

I honestly don’t know the best strategy for choosing these descriptors and tried to go with ones that seemed the least esoteric. The value of K does not seem to make any difference in the classification.

Here is the output

In [42]: SA.classifySoundkNN(“qs/175454/175454/175454_2042115-lq.json”, “tmp”,13, descInput = [0,5,10])
This sample belongs to class: bassoon
Out[42]: ‘bassoon’

2d. JSON analysis data for “bad voice” example…

#: cd ~/sms-tools/workspace/A9/qs/175454/175454

# cat 175454_2042115-lq.json | python -mjson.tool

3. acousticbrainz

(typical analysis page…) https://reactivemusic.net/?p=17641

4. Echonest acoustic analysis

fingerprinting and human factors like danceability…

specification document: https://reactivemusic.net/?p=6276
Max example: https://reactivemusic.net/?p=6296
en_analyzer~ object https://reactivemusic.net/?p=6282
Tristan Jehan’s Max/MSP externals http://web.media.mit.edu/~tristan/maxmsp.html

Randomness

continuum

random -> ordered (stochastic -> deterministic (for academics))

Does technology have a sound?

Can you add small bits of randomness?

Can you detect randomness?
How much repetition is enough?

Brain wiring diagram: https://reactivemusic.net/?p=17758

Christof Koch – check out this video at around 13:33 for about 2 minutes http://www.technologyreview.com/emtech/14/video/watch/christof-koch-hacking-the-soul/

With every technology, musicians figure out how to use it another way. Starting with stone tools, bow and arrow, and now computers.

Data science skills: https://reactivemusic.net/?p=17707

Car engine synthesizer

At the end of the class we revved the engine of a 2015 Golf TSI, connected to an engine sound simulator in Max, on Boylston street…

https://reactivemusic.net/?p=7643

November 23, 2014November 29, 2014

ep-413 DSP – week 12

Web Audio API

A good place to start

There are many links to Web Audio project right here on this blog: https://reactivemusic.net/?tag=web-audio

Technical reference manual

Comprehensive guide from Mozilla Developer Network https://developer.mozilla.org/en-US/docs/Web/API/Web_Audio_API

Quick tutorial

Open a javascript console in a Web browser. This example uses Chrome. From the Chrome menu, select View | Developer | Javascript console
Type the following commands (in the shaded blocks) into the console:

Assign an instance of the Web Audio class to the object: context

context = new AudioContext();

Make an oscillator node

osc = context.createOscillator();

Connect the oscillator to the audio output (speaker)

osc.connect(context.destination);

Turn on the oscillator at the current time and plays a tone.

osc.noteOn(0);

Turn of the oscillator

osc.noteOff(0);

For more basics, see: “Using the Web Audio API”, at the Mozilla Developer Network: https://developer.mozilla.org/en-US/docs/Web/API/Web_Audio_API/Using_Web_Audio_API

Basic html examples:

Demonstrated in class. Download from here: https://github.com/tkzic/web-audio-projects

Oscillator
Audio file player

HTML versions of examples from Boris Smus’s book: https://reactivemusic.net/?p=6094

Examples

Plink by Dinahmoe http://labs.dinahmoe.com/plink/
Web Audio Toy: https://reactivemusic.net/?p=17578
Sample demos: https://chromium.googlecode.com/svn/trunk/samples/audio/samples.html
javascript motion detection http://www.soundstep.com/blog/experiments/jsdetection/ ( broken)
infinite jukebox by Paul Lamere http://labs.echonest.com/Uploader/index.html
WebGL city http://alteredqualia.com/three/examples/webgl_city.html
Ring Visualizer http://airtightinteractive.com/demos/js/reactive/
Wave-pd1 (iOS) http://alxgbsn.co.uk/wavepad/
Granular synthesizers –
- http://chromium.googlecode.com/svn/trunk/samples/audio/granular.html (broken)
- http://www.chromeexperiments.com/detail/granular-synthesiser/?f=mobile
webRTC – https://reactivemusic.net/?p=5174
Chrome Experiments
- webcam music by Karen Peng: http://www.chromeexperiments.com/detail/music/?f=
- Sound Viz by Bartek Drozdz: http://www.chromeexperiments.com/detail/soundviz/?f=
- Noisee by Karen Peng: http://www.chromeexperiments.com/detail/noisee/?f=

Tutorials and references

Basic tutorial: http://middleearmedia.com/web-audio-api-basics/
“The Web Audio API” by Boris Smus: https://github.com/borismus/webaudioapi.com (github repository)
html5rocks – tutorials: http://updates.html5rocks.com/2012/01/Web-Audio-FAQ
Web Audio API getting started (creative.js) http://creativejs.com/resources/web-audio-api-getting-started/

Development (libraries, frameworks)

Gibber by Charie Roberts: http://gibber.mat.ucsb.edu/ (for demos, select all code, press <ctrl>-Enter to start, press <ctrl>-. to stop) (updated but check github)
WebPd – Sebastien Piquemal – https://github.com/sebpiq/WebPd/tree/develop
gibberface by Charlie Roberts – https://reactivemusic.net/?p=16910
5 Web Audio libraries: https://reactivemusic.net/?p=16902
flocking by Colin Clark https://github.com/colinbdclark/Flocking
p5js: http://p5js.org/libraries/

tz – examples

Web Audio API – Web Audio Playground, Google API, Osc, Web Sockets, Max/MSP https://reactivemusic.net/?p=6243
Hardware control (touchOSC) of Web Audio API https://reactivemusic.net/?p=6142
Controlling Web Audio with Max (video) https://reactivemusic.net/?p=6212
Controlling Web Audio with Max https://reactivemusic.net/?p=6193
Programming a Sine wave in Web Audio (flocking by Colin Clark) https://reactivemusic.net/?p=6574
Simple Web Audio input (microphone) example: https://reactivemusic.net/?p=3886

Assignment

Write a composition to induce magical effects.

Here is an example from Aseem Suri http://www.aseemsuri.com/journal/piece-of-mind-second-run-at-the-csound-conference

The project was derived from computer technology, but the overall effect was that people would go into a mysterious room, for a minute, and when they emerged, they would be smiling and happy.

Due on December 15th (last class)

November 16, 2014November 23, 2014

ep-413 DSP – week 11

Filters

Analog

Guitar tone control
impedance matching http://www.learnabout-electronics.org/ac_theory/transformers04.php
resistors, capacitor RC circuits http://www.electronics-tutorials.ws/filter/filter_2.html
capacitor calculation: http://www.convertunits.com/from/farad/to/picofarad
identifying capactiors: http://grathio.com/assets/capacitor_tags.pdf
Wolfram Alpha: http://www.wolframalpha.com/input/?i=1+%2F+%28.0000000047+*+10000+*+2+*+3.1415%29

Digital

Max filter tutorial: https://cycling74.com/wiki/index.php?title=MSP_Filter_Tutorial_1:_Simple_Filters
- cutoff frequency, stopband, passband, bandwidth, q, resonance, poles, slope, phase shift, delay, order,
- impulse, impulse response, fir, iir
Links to all Max filter tutorials: https://cycling74.com/wiki/index.php?title=Special%3ASearch&profile=default&search=filter+tutorial&fulltext=Search
Filter modulation and envelopes
the frequency domain: noise gates, parametric eq
- forbidden planet (max example)
- fft filter by Katja Vetter https://reactivemusic.net/?p=11207
sinusoidal, harmonic, stochastic models

Other

We usually think of filters in terms of frequency. Any process that removes information is a filter. Curation or abstraction, for example.

multirate DSP: upsampling, downsampling, decimation and interpolation https://reactivemusic.net/?p=11392
slew-limiter: https://reactivemusic.net/?p=2506
randomness, entropy
heterodyne filter: https://reactivemusic.net/?p=17338
nothing-detector: https://reactivemusic.net/?p=11770

Reversability

encryption – Markus Brandau – http://upcommons.upc.edu/pfc/bitstream/2099.1/4858/1/MarkusBrandau.pdf
various methods: https://reactivemusic.net/?p=7752
Fiber optic noise cancellation: https://reactivemusic.net/?p=6680
Noise cancellation for light: https://reactivemusic.net/?p=6381

language translation http://richard-blanco.com/book/city-of-a-hundred-fires/mother-picking-produce/

examples

Simple phase cancellation (disappearing-act.maxpat) https://reactivemusic.net/?p=8879
Encryption by masking and phase cancellation: (phase-cancellation-example.maxpat)
2 methods of reversible audio encryption in Max: reciprocal and frequency shifting: https://reactivemusic.net/?p=9030 (voice-encryption2b.maxpat)
FM, AM, and SSB modulation/demodulation in Max https://reactivemusic.net/?p=8885
weird spectral symmetry: https://reactivemusic.net/?p=11747

November 10, 2014November 11, 2014

ep-413 DSP week 10

Music from data

We looked at Vine API Examples from Eli and Steve H.

OSC (Open Sound Control)

Max udpsend, udpreceive, touchOsc, aka.speech
using a wifi router for performance projects

Using an Osc server to handle Internet API’s

ruby, python, php
ruby twitter example in Max https://reactivemusic.net/?p=7013

Searchtweet (rate limited): https://reactivemusic.net/?p=17425

Max oggz streaming example: https://reactivemusic.net/?p=17504

We talked about cartoons: https://reactivemusic.net/?p=10091

Sonification

parameter mapping
modeling
earcons

We didn’t talk about alarm fatigue…

MBTA bus example in Max:

(parameter mapping sonification)

https://reactivemusic.net/?p=17524

documentation
API keys
JSON
Max example

Unsolved problems

Is it possible to generate music from data?

November 2, 2014November 10, 2014

ep-413 DSP week 9

Data.

Building a Max patch that displays, transforms, and responds to internet data.

building materials

Max (6.1.7 or newer)
Soundflower –

Both available from Cycling 74 http://cycling74.com/

The Max patch is based on a tutorial by dude837 called “Automatic Silly Video Generator”

download

The patch at the download link in the video is broken – but the javascript code for the Max js object is intact. You can download the entire patch from the Max-projects archive: https://github.com/tkzic/max-projects folder: maxvine

Internet API’s

API’s (application programming interfaces) provide methods for programs (other than web browsers) to access Internet data. Any app that access data from the web uses an API.

Here is a link to information about the Vine API: https://github.com/starlock/vino/wiki/API-Reference

For example, if you copy this URL into a web browser address bar, it will return a block of data in JSON format about the most popular videos on Vine: https://api.vineapp.com/timelines/popular

HTTP requests

An HTTP request transfers data to or from a server. A web browser handles HTTP requests in the background. You can also write programs that make HTTP requests. A program called “curl” runs http requests from the terminal command line. Here are examples: https://reactivemusic.net/?p=5916

Response data

Data is usually returned in one of 3 formats:

JSON
XML
HTML

JSON is the preferred method because its easy to access the data structure.

Max HTTP requests

There are several ways to make HTTP requests in Max, but the best method is the js object: Here is the code that runs the GET request for the Vine API:

function get(url)
{
    var ajaxreq = new XMLHttpRequest();
    ajaxreq.open("GET", url);
    ajaxreq.onreadystatechange = readystatechange;
    ajaxreq.send();
}

function readystatechange()
{
    var rawtext = this._getResponseKey("body");
    var body = JSON.parse(rawtext);
    outlet(0, body.data.records[0].videoUrl);
}

The function: get() formats and sends an HTTP request using the URL passed in with the get message from Max. When the data is returned to Max, the readystatechange() function parses it and sends the URL of the most popular Vine video out the left outlet of the js object.

Playing Internet audio/video files in Max

The qt.movie object will play videos, with the URL passed in by the read message.

Unfortunately, qt.movie sends its audio to the system, not to Max. You can use Soundflower, or a virtual audio routing app, to get the audio back into Max.

Audio from video

https://reactivemusic.net/?p=12570

Color tracking (histogram analysis: https://reactivemusic.net/?p=12598
frame subtraction
frame subtraction with jit.cv
Cyclops (by Eric SInger)

Video from audio

https://reactivemusic.net/?p=12570

Vizzie https://reactivemusic.net/?p=11777
Amplitude, spectrum, and transient detection.
spigot~ https://www.youtube.com/watch?v=qSOqC3lqEnY

Other Internet API examples in Max

There is a large archive of examples here: Internet sensors: https://reactivemusic.net/?p=5859

We will look at more of these next week. Here is simple Max patch that uses the Soundcloud API: https://reactivemusic.net/?p=17430

Gokce Kinayoglu has written a java external for Max called Searchtweet: http://cycling74.com/toolbox/searchtweet-design-patches-that-respond-to-twitter-posts/

Many API’s require complex authentication, or money, before they will release their data. We will look ways to access these API’s from Max next week.

Aggregators

There are API services that consolidate many API’s into one API. For example:

Temboo https://www.temboo.com/library/
Mashape
IFTTT

Scaling data

Look at the Max tutorial (built in to Max Help) called “Data : data scaling” It contains most of what you need to know to work with streams of data.

Assignment

Using the Vine API patch that we built during the class as a starting point: Build a better app.

Ideas to explore:

Is it possible to run several API requests simultaneously?
Recording? Time expansion? Effects that evolve over time?
Generate music from motion, data, and raw sound?
Make a video respond to your instrument or voice?
Design a better user interface or external controller?
Will this idea work in Max For Live?
How would you make adjustments to the loop length, or synchronize a video to other events?
Make envelopes to change the dynamic shape?
Destruction? Abstraction?
Find or write a Max URL streaming object?
What about using a different API or other data from the Vine API?

This project will be due in 2-3 weeks. But for next week please bring in your work in progress, and we will help solve problems.