By Klemens Torggler
By Klemens Torggler
Using Google speech API and Pandorabots API
(updated 1/21/2024)
all of these changes are local – for now.
replace path to sox with /opt/homebrew/bin/sox in [p call-google-speech]
Also had to write a new python script to convert xml to json. Its in the subfolder /xml2json/xml4json.py
The program came from this link: https://www.geeksforgeeks.org/python-xml-to-json/
Also inside [p call-pandorabots] the path for this python program had to be explicit to the full path on the computer. this will vary depending on your python installation.
Also, note that you must install a dependency with pip:
pip install xmltodict
After all that I was actually able to have a conversation. These bots seem primitive, but loveable, now compared to chatGPT. Guess its time for a new project.
Also the voice selection for speech synth is still not connected
(updated 1/21/2021)
This project is an extension to the speech-to-text project: https://reactivemusic.net/?p=4690 You might want to try running that project first to get the Google speech API running.
sox: sox audio conversion program must be in the computer’s executable file path, ie., /usr/bin – or you can rewrite the [sprintf] input to [aka.shell] with the actual path. In our case we installed sox using Macports. The executable path is /opt/local/bin/sox – which is built into a message object in the subpatcher [call-google-speech]
get sox from: http://sox.sourceforge.net
Need to fix the selection of voices.
Also please see these notes about how to modify the patch with your key – until this gets resolved: https://reactivemusic.net/?p=11035
Resonate: exploring possibilities with sounds and spaces
By Mark Durham at “Sound Design with Max”
http://sounddesignwithmax.blogspot.com/2014/01/creative-convolution-part-1-resonate.html
from various artists
tools that make tools
update 6/2014 – Now part of the Internet sensors projects: https://reactivemusic.net/?p=5859
original post
They can talk with each other… sort of.
Last spring I made a project that lets you talk with chatbots using speech recognition and synthesis. https://reactivemusic.net/?p=4710.
Yesterday I managed to get two instances of this program, running on two computers, using two chatbots, to talk with each other, through the air. Technical issues remain (see below). But there were moments of real interaction.
In the original project, a human pressed button in Max to start and stop recording speech. This has been automated. The program detects and records speech, using audio level sensing. The auto-recording sensor turns on a switch when the level hits a threshold, and turns off after a period of silence. Threshold level and duration of silence can be adjusted by the user. There is also a feedback gate that shuts off auto-record while the computer is converting speech to text, and ‘speaking’ a reply.
Alternative pathways prevent circular collapse.
By Paul Lamere at musicmachinery.com
http://musicmachinery.com/2012/11/19/visualizing-the-structure-of-pop-music/
The road less traveled.
Canine mode reharmonization
By Paul Lamere from musicmachinery.com
http://static.echonest.com/dogstep/go.html?trid=TRMQYSY13B4C10127E
Analysis of songs and artists frequency banned from playlists
By Paul Lamere at musicmachinery.com
http://musicmachinery.com/2013/11/11/pappa-pauls-awesome-party-playlisting-app/
Echonest analysis data for any Soundcloud track.
By Paul Lamere at musicmachinery.com
Note: 6/2021 – Since the API has moved to Spotify, analysis is only allowed on Spotify tracks.
http://musicmachinery.com/2013/11/15/soundcloud-echo-nest-bridge-service/
Demo: http://labs.echonest.com/SCAnalyzer/sample.html