{"id":4159,"date":"2012-12-11T07:24:10","date_gmt":"2012-12-11T07:24:10","guid":{"rendered":"http:\/\/zerokidz.com\/ideas\/?p=4159"},"modified":"2014-07-01T22:39:39","modified_gmt":"2014-07-02T02:39:39","slug":"speech-recognition-in-max","status":"publish","type":"post","link":"https:\/\/reactivemusic.net\/?p=4159","title":{"rendered":"Speech recognition in Max"},"content":{"rendered":"<p>(update 6\/2014): its easier to use the Google speech-api by calling it from curl. See recent examples at:\u00a0<a href=\"https:\/\/reactivemusic.net\/?p=4690\">https:\/\/reactivemusic.net\/?p=4690<\/a><\/p>\n<p><a href=\"https:\/\/reactivemusic.net\/?p=4690\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-16976\" src=\"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png\" alt=\"\" width=\"300\" height=\"174\" srcset=\"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png 300w, https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM.png 938w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<div class=\"panel\">\n<p>original post:<\/p>\n<\/div>\n<p>from Luke Hall in the c74 forum:<\/p>\n<p><a href=\"http:\/\/cycling74.com\/forums\/topic.php?id=18403\">http:\/\/cycling74.com\/forums\/topic.php?id=18403<\/a><\/p>\n<p>I&#8217;ve used Macspeech Dictate in this way. In fact it uses the same speech recognition engine as Dragon Naturally Speaking, it works very well but you could potentially run into the same problems as CJ described above.<\/p>\n<p>Another way to achieve this on a mac is using the built in voice recognition and applescripts and extra suites, which is an applescript extension that extends the range of what you can do, including letting you send key presses.<\/p>\n<p>1. Turn on &#8220;speakable items&#8221; from system preferences &gt; speech &gt; speech recognition.<br \/>\n2. Open max.<br \/>\n3. Open script editor and write a script like this:<\/p>\n<p>tell application &#8220;MaxMSP&#8221; to activate<br \/>\ntell application &#8220;Extra Suites&#8221;<br \/>\nES type key &#8220;1&#8221;<br \/>\nend tell<\/p>\n<p>4. Save it in library &gt; speech &gt; speakable items &gt; application speakable items &gt; maxmsp and name the file whatever you want the voice command to be, for example &#8220;press one&#8221;<br \/>\n6. Now on the floating speech icon click the down arrow at the bottom and &#8220;open speech commands window&#8221;. With max as the front-most application check that the commands you just saved as applescripts have appeared in the maxmsp folder.<br \/>\n7. Now simply hook up a [key] object in max, press &#8220;escape&#8221; (or whichever key you have set up to turn speech recognition on) and say &#8220;press one&#8221; and you should have [key] spit out &#8220;49&#8221;!<\/p>\n<p>Sorry about the length explanation I hope it makes sense to you and gives you another possible (and cheaper!) method of obtaining you goals.<\/p>\n<p>Oh and the applescript extension can be downloaded from:\u00a0<a href=\"http:\/\/www.kanzu.com\/\">http:\/\/www.kanzu.com\/<\/a><\/p>\n<p>lh<\/p>\n","protected":false},"excerpt":{"rendered":"<p>(update 6\/2014): its easier to use the Google speech-api by calling it from curl. See recent examples at:\u00a0https:\/\/reactivemusic.net\/?p=4690 original post: from Luke Hall in the c74 forum: http:\/\/cycling74.com\/forums\/topic.php?id=18403 I&#8217;ve used Macspeech Dictate in this way. In fact it uses the same speech recognition engine as Dragon Naturally Speaking, it works very well but you could &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/reactivemusic.net\/?p=4159\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Speech recognition in Max&#8221;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_coblocks_attr":"","_coblocks_dimensions":"","_coblocks_responsive_height":"","_coblocks_accordion_ie_support":"","footnotes":""},"categories":[143],"tags":[345,296,138,137],"class_list":["post-4159","post","type-post","status-publish","format-standard","hentry","category-interactive-media-art","tag-maxmsp","tag-notebook","tag-speech-to-text","tag-text-to-speech"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Speech recognition in Max - reactive music<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/reactivemusic.net\/?p=4159\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Speech recognition in Max - reactive music\" \/>\n<meta property=\"og:description\" content=\"(update 6\/2014): its easier to use the Google speech-api by calling it from curl. See recent examples at:\u00a0https:\/\/reactivemusic.net\/?p=4690 original post: from Luke Hall in the c74 forum: http:\/\/cycling74.com\/forums\/topic.php?id=18403 I&#8217;ve used Macspeech Dictate in this way. In fact it uses the same speech recognition engine as Dragon Naturally Speaking, it works very well but you could &hellip; Continue reading &quot;Speech recognition in Max&quot;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/reactivemusic.net\/?p=4159\" \/>\n<meta property=\"og:site_name\" content=\"reactive music\" \/>\n<meta property=\"article:published_time\" content=\"2012-12-11T07:24:10+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2014-07-02T02:39:39+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png\" \/>\n<meta name=\"author\" content=\"Tom Zicarelli\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Tom Zicarelli\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159\"},\"author\":{\"name\":\"Tom Zicarelli\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/#\\\/schema\\\/person\\\/56224d281582df7e5518e037ca63e571\"},\"headline\":\"Speech recognition in Max\",\"datePublished\":\"2012-12-11T07:24:10+00:00\",\"dateModified\":\"2014-07-02T02:39:39+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159\"},\"wordCount\":298,\"image\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/reactivemusic.net\\\/wp-content\\\/uploads\\\/2012\\\/12\\\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png\",\"keywords\":[\"Max\\\/MSP\",\"notebook\",\"speech to text\",\"text to speech\"],\"articleSection\":[\"interactive media art\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159\",\"url\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159\",\"name\":\"Speech recognition in Max - reactive music\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/reactivemusic.net\\\/wp-content\\\/uploads\\\/2012\\\/12\\\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png\",\"datePublished\":\"2012-12-11T07:24:10+00:00\",\"dateModified\":\"2014-07-02T02:39:39+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/#\\\/schema\\\/person\\\/56224d281582df7e5518e037ca63e571\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/reactivemusic.net\\\/?p=4159\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#primaryimage\",\"url\":\"https:\\\/\\\/reactivemusic.net\\\/wp-content\\\/uploads\\\/2012\\\/12\\\/Screen-Shot-2014-07-01-at-10.38.42-PM.png\",\"contentUrl\":\"https:\\\/\\\/reactivemusic.net\\\/wp-content\\\/uploads\\\/2012\\\/12\\\/Screen-Shot-2014-07-01-at-10.38.42-PM.png\",\"width\":938,\"height\":546},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/?p=4159#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/reactivemusic.net\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Speech recognition in Max\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/#website\",\"url\":\"https:\\\/\\\/reactivemusic.net\\\/\",\"name\":\"reactive music\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/reactivemusic.net\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/reactivemusic.net\\\/#\\\/schema\\\/person\\\/56224d281582df7e5518e037ca63e571\",\"name\":\"Tom Zicarelli\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/0da58cf21a2707dd335b204b8ed3cd9194dcbf9d9814ac5d71195a65c76c8a72?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/0da58cf21a2707dd335b204b8ed3cd9194dcbf9d9814ac5d71195a65c76c8a72?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/0da58cf21a2707dd335b204b8ed3cd9194dcbf9d9814ac5d71195a65c76c8a72?s=96&d=mm&r=g\",\"caption\":\"Tom Zicarelli\"},\"sameAs\":[\"http:\\\/\\\/tomzicarelli.com\"],\"url\":\"https:\\\/\\\/reactivemusic.net\\\/?author=2\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Speech recognition in Max - reactive music","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/reactivemusic.net\/?p=4159","og_locale":"en_US","og_type":"article","og_title":"Speech recognition in Max - reactive music","og_description":"(update 6\/2014): its easier to use the Google speech-api by calling it from curl. See recent examples at:\u00a0https:\/\/reactivemusic.net\/?p=4690 original post: from Luke Hall in the c74 forum: http:\/\/cycling74.com\/forums\/topic.php?id=18403 I&#8217;ve used Macspeech Dictate in this way. In fact it uses the same speech recognition engine as Dragon Naturally Speaking, it works very well but you could &hellip; Continue reading \"Speech recognition in Max\"","og_url":"https:\/\/reactivemusic.net\/?p=4159","og_site_name":"reactive music","article_published_time":"2012-12-11T07:24:10+00:00","article_modified_time":"2014-07-02T02:39:39+00:00","og_image":[{"url":"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png","type":"","width":"","height":""}],"author":"Tom Zicarelli","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Tom Zicarelli","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/reactivemusic.net\/?p=4159#article","isPartOf":{"@id":"https:\/\/reactivemusic.net\/?p=4159"},"author":{"name":"Tom Zicarelli","@id":"https:\/\/reactivemusic.net\/#\/schema\/person\/56224d281582df7e5518e037ca63e571"},"headline":"Speech recognition in Max","datePublished":"2012-12-11T07:24:10+00:00","dateModified":"2014-07-02T02:39:39+00:00","mainEntityOfPage":{"@id":"https:\/\/reactivemusic.net\/?p=4159"},"wordCount":298,"image":{"@id":"https:\/\/reactivemusic.net\/?p=4159#primaryimage"},"thumbnailUrl":"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png","keywords":["Max\/MSP","notebook","speech to text","text to speech"],"articleSection":["interactive media art"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/reactivemusic.net\/?p=4159","url":"https:\/\/reactivemusic.net\/?p=4159","name":"Speech recognition in Max - reactive music","isPartOf":{"@id":"https:\/\/reactivemusic.net\/#website"},"primaryImageOfPage":{"@id":"https:\/\/reactivemusic.net\/?p=4159#primaryimage"},"image":{"@id":"https:\/\/reactivemusic.net\/?p=4159#primaryimage"},"thumbnailUrl":"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM-300x174.png","datePublished":"2012-12-11T07:24:10+00:00","dateModified":"2014-07-02T02:39:39+00:00","author":{"@id":"https:\/\/reactivemusic.net\/#\/schema\/person\/56224d281582df7e5518e037ca63e571"},"breadcrumb":{"@id":"https:\/\/reactivemusic.net\/?p=4159#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/reactivemusic.net\/?p=4159"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/reactivemusic.net\/?p=4159#primaryimage","url":"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM.png","contentUrl":"https:\/\/reactivemusic.net\/wp-content\/uploads\/2012\/12\/Screen-Shot-2014-07-01-at-10.38.42-PM.png","width":938,"height":546},{"@type":"BreadcrumbList","@id":"https:\/\/reactivemusic.net\/?p=4159#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/reactivemusic.net\/"},{"@type":"ListItem","position":2,"name":"Speech recognition in Max"}]},{"@type":"WebSite","@id":"https:\/\/reactivemusic.net\/#website","url":"https:\/\/reactivemusic.net\/","name":"reactive music","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/reactivemusic.net\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/reactivemusic.net\/#\/schema\/person\/56224d281582df7e5518e037ca63e571","name":"Tom Zicarelli","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/0da58cf21a2707dd335b204b8ed3cd9194dcbf9d9814ac5d71195a65c76c8a72?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/0da58cf21a2707dd335b204b8ed3cd9194dcbf9d9814ac5d71195a65c76c8a72?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/0da58cf21a2707dd335b204b8ed3cd9194dcbf9d9814ac5d71195a65c76c8a72?s=96&d=mm&r=g","caption":"Tom Zicarelli"},"sameAs":["http:\/\/tomzicarelli.com"],"url":"https:\/\/reactivemusic.net\/?author=2"}]}},"_links":{"self":[{"href":"https:\/\/reactivemusic.net\/index.php?rest_route=\/wp\/v2\/posts\/4159","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/reactivemusic.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/reactivemusic.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/reactivemusic.net\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/reactivemusic.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4159"}],"version-history":[{"count":8,"href":"https:\/\/reactivemusic.net\/index.php?rest_route=\/wp\/v2\/posts\/4159\/revisions"}],"predecessor-version":[{"id":16977,"href":"https:\/\/reactivemusic.net\/index.php?rest_route=\/wp\/v2\/posts\/4159\/revisions\/16977"}],"wp:attachment":[{"href":"https:\/\/reactivemusic.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4159"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/reactivemusic.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4159"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/reactivemusic.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4159"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}