Tag: API

Notes: Chatbots in Conversation

update 6/2014 – Now part of the Internet sensors projects: http://reactivemusic.net/?p=5859

original post

They can talk with each other… sort of.

Last spring I made a project that lets you talk with chatbots using speech recognition and synthesis. http://reactivemusic.net/?p=4710.

Yesterday I managed to get two instances of this program, running on two computers, using two chatbots,  to talk with each other, through the air. Technical issues remain (see below). But there were moments of real interaction.

In the original project, a human pressed button in Max to start and stop recording speech. This has been automated. The program detects and records speech, using audio level sensing. The auto-recording sensor turns on a switch when the level hits a threshold, and turns off after a period of silence. Threshold level and duration of silence can be adjusted by the user.  There is also a feedback gate that shuts off auto-record while the computer is converting speech to text, and ‘speaking’ a reply.

technical issues

  • The Google speech API has difficulty with some of the voices used by the Mac OS speech synthesizer. We’ll need to experiment to find which voices produce accurate results.
  • The overall levels produced by the builtin Macbook speakers is not quite enough to achieve clear communication. The auto-recorder missed the onset of speech sometimes. One solution would be to insert a click to trigger the recorder, just before the speech synthesizer begins the actual speech. Or to use external speakers, or a secondary “wired” connection.
  • It would be nice to have menus of chatbots and voices. Also to automate the start of a new conversation thread.
  • The button to start the audio detector had to be operated by key-press because pushing the trackpad on a MacBook makes too much noise and always triggers the audio level detector.
  • Occasionally a chat bot would deliver a long response, or one containing a web address. These were problematic for recognition and synthesis.

local files

  • tkzic/internetsensors/speech-to-google-text-api3.maxpat
  • tkzic/internetsensors/pandorabots-api2.maxpat
  • tkzic/internetsensors/text-to-speech3.maxpat

 

Ruby tweetstream gem

Access to the Twitter streaming API

notes

This gem looks to be a possible solution for connecting Max to Google Maps. I’m guessing that the code will look very similar to the ruby code which uses the twitter gem for status updates.

https://github.com/tweetstream/tweetstream

Here’s a blog entry:

http://thechangelog.com/tweetstream-easily-access-the-twitter-streaming-api/

Client documentation:

http://rdoc.info/github/intridea/tweetstream/TweetStream/Client

Here’s the Twitter documentation on whats getting returned:

https://dev.twitter.com/docs/platform-objects/tweets

 

Getting started with Google Maps

Installed working example at:

http://zerokidz.com/gmap/map-simple.html

The Google static map API:

https://developers.google.com/maps/documentation/staticmaps/#quick_example

The tracks API:

https://developers.google.com/maps/documentation/tracks/

Simple Google Maps example from Jayway:

http://www.jayway.com/2012/04/02/simple-google-maps-api-example/

Yet another example using geolocation:

http://www.sitepoint.com/working-with-geolocation-and-google-maps-api/

Got this one working, sort of, here:

http://zerokidz.com/gmap/geolocation.html