{"id":3963,"date":"2018-11-21T18:06:21","date_gmt":"2018-11-21T18:06:21","guid":{"rendered":"https:\/\/www.appservgrid.com\/paw92\/?p=3963"},"modified":"2018-12-07T03:41:34","modified_gmt":"2018-12-07T03:41:34","slug":"smart-speaker-voice-platforms-compared","status":"publish","type":"post","link":"https:\/\/www.appservgrid.com\/paw92\/index.php\/2018\/11\/21\/smart-speaker-voice-platforms-compared\/","title":{"rendered":"Smart speaker voice platforms compared"},"content":{"rendered":"<p><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/mycroft_mark2_temp-thm.jpg\" \/>At the Embedded Linux Conference Europe, Leon Anavi compared the Alexa and Google Assistant voice platforms and looked into open source newcomer Mycroft Mark II.<\/p>\n<p>U.S. consumers are expected to drop a bundle this Black Friday on smart speakers and home hubs. A Nov. 15 <a href=\"https:\/\/www.canalys.com\/newsroom\/amazon-reclaims-top-spot-in-smart-speaker-market-in-q3-2018\" target=\"new\">Canalys report<\/a> estimates that shipments of voice-assisted speakers grew 137 percent in Q3 2018 year-to-year and are on the way to 75 million-unit sales in 2018. At the recent Embedded Linux Conference Europe in Edinburgh, embedded Linux developer and <a href=\"http:\/\/linuxgizmos.com\/phat-adds-ir-to-the-raspberry-pi\/\" target=\"new\">Raspberry Pi HAT<\/a> creator Leon Anavi of the Konsulko Group reported on the latest smart speaker trends.<\/p>\n<p><a href=\"\/files\/elce_voice_assistant.jpg\" target=\"new\"><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/elce_voice_assistant-sm.jpg\" \/><\/a><\/p>\n<p><b>At ELCE, Leon Anavi explains inner workings of Google Assistant SDK<\/b><br \/>\n(click image to enlarge)<\/p>\n<p>&nbsp;<\/p>\n<p>As Anavi noted in his \u201cComparison of Voice Assistant SDKs for Embedded Linux Devices\u201d talk, conversing with computers became a staple of science fiction over half a century ago. Voice technology is interesting \u201cbecause it combines AI, big data, IoT, and application development,\u201d said Anavi.<\/p>\n<p>In Q3 2017, Amazon and Google owned the industry with 74.7 percent and 24.6 percent, respectively, said Canalys. A year later, the percentages were down to 31.9 and 29.8. China-based Alibaba and Xiaomi almost equally split another 21.8 percent share, followed by 17.4 percent for \u201cothers,\u201d which mostly use Amazon Alexis, and increasingly, Google Assistant.<\/p>\n<p>Despite the success of the mostly Linux-driven smart speaker market, Linux application developers have not jumped into voice app development in the numbers one might expect. In part, this is due to reservations about Google and <a href=\"https:\/\/qz.com\/1288743\/amazon-alexa-echo-spying-on-users-raises-a-data-privacy-problem\/\" target=\"new\">Amazon privacy safeguards<\/a>, as well as the proprietary nature of the hardware and cloud software.<\/p>\n<p>\u201cPrivacy is a concern with smart speakers,\u201d said Anavi. \u201cYou can\u2019t fully trust a corporation if the product is not open source.\u201d<\/p>\n<p>Anavi summarized the Google and Amazon SDKs but spent more time on the fully open source Mycroft Mark. Although Anavi clearly prefers Mycroft, he encouraged developers to investigate all the platforms. \u201cThere is a huge demand in the market for these devices and a lot of opportunity for IoT integration, from writing new skills to integrating voice assistants in consumer electronics devices,\u201d said Anavi.<\/p>\n<p><b>Alexa\/Echo<\/b><\/p>\n<p>Amazon\u2019s Alexa debuted in the Echo smart speaker four years ago. Amazon has since expanded to the Echo branded Dot, Spot, Tap, and Plus speakers, as well as the Echo Show and new <a href=\"https:\/\/www.techadvisor.co.uk\/review\/digital-home\/amazon-echo-show-2-3685964\/\" target=\"new\">Echo Show 2<\/a> display hubs.<\/p>\n<p><a href=\"\/files\/amazon_echoshow2.jpg\" target=\"new\"><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/amazon_echoshow2-sm.jpg\" \/><\/a><\/p>\n<p><b>Amazon Echo Show 2<\/b><br \/>\n(click image to enlarge)<\/p>\n<p>&nbsp;<\/p>\n<p>The market leading Echo devices run on Amazon\u2019s Linux- and Android-based Fire OS. The original Echo and Dot ran on the Cortex-A8-based TI DM3725 SoC while more recent devices have moved to an Armv8 MediaTek MT8163V SoC with 256MB RAM and 4GB flash.<\/p>\n<p>Thanks to Amazon\u2019s wise decision to release an Apache 2.0 licensed Alexa Voice Services (AVS) SDK, Alexa also runs on most third-party hubs. The SDK includes an Alexa Skills Kit for creating custom Skills. The cloud platform required to make Alexa devices work is not open source, however, and commercial vendors must sign an agreement and undergo a certification process.<\/p>\n<p>Alexa runs on a variety of hardware <a href=\"https:\/\/www.linux.com\/news\/event\/open-source-summit-na\/2017\/3\/add-skills-your-raspberry-pi-alexa\" target=\"new\">including the Raspberry Pi<\/a>, as well as smart devices ranging from the Ecobee4 Smart Thermostat to the LG Hub Robot. Microsoft recently began <a href=\"https:\/\/www.theverge.com\/2018\/11\/17\/18099978\/microsoft-store-amazon-echo-devices\" target=\"new\">selling Echo devices<\/a>, and earlier this year partnered with Amazon to integrate Alexa with its own Cortana voice agent in devices. This week, Microsoft announced that users can <a href=\"https:\/\/www.engadget.com\/2018\/11\/19\/alexa-can-now-make-skype-calls\/\" target=\"new\">voice-activate Skype calls<\/a> via Alexa on Echo devices.<\/p>\n<p>On Nov. 20, <a href=\"https:\/\/www.theverge.com\/2018\/11\/20\/18105212\/amazon-alexa-mobile-accessory-kit-ai-voice-control-bluetooth-headphones-wearables\" target=\"new\">Amazon announced<\/a> it had publicly released its Alexa Mobile Accessory Kit to help developers bring Alexa to Bluetooth headphones, headsets, and wearables. The developers kit lets Bluetooth devices communicate with a phone\u2019s Alexa app without requiring the device makers to build a custom app or Alexa skill.<\/p>\n<p><b>Google Assistant\/Home<\/b><\/p>\n<p>The Google Assistant voice agent debuted on the Google Home smart speaker in 2016. It has since expanded to the Echo Dot-like Home Mini, which like the Home runs on a 1.2GHz dual-core Cortrex-A7 Marvell Armada 1500 Mini Plus with 512MB RAM and 4GB flash. This year\u2019s <a href=\"https:\/\/store.google.com\/us\/product\/google_home_max?hl=en-US\" target=\"new\">Home Max<\/a> offered improved speakers and advanced to a 1.5GHz, quad-core Cortex-A53 processor. More recently, Google launched the touchscreen enabled <a href=\"https:\/\/arstechnica.com\/gadgets\/2018\/10\/google-home-hub-under-the-hood-its-nothing-like-other-google-smart-displays\/\" target=\"new\">Google Home Hub<\/a>.<\/p>\n<p>The Google Home devices run on a version of the Linux-based Google Cast OS. Like Alexa, the Python driven <a href=\"https:\/\/developers.google.com\/assistant\/sdk\/overview\" target=\"new\">Google Assistant SDK<\/a> lets you add the voice agent to third-party devices. However, it\u2019s still in preview stage and lacks an open source license. Developers can create applications with <a href=\"https:\/\/developers.google.com\/actions\/\" target=\"new\">Google Actions<\/a>.<\/p>\n<p>Last year, Google <a href=\"http:\/\/linuxgizmos.com\/google-assistant-sdk-dev-preview-brings-voice-agent-to-the-raspberry-pi\/\" target=\"new\">launched<\/a> a version of its Google Assistant SDK for the Raspberry Pi 3 and began selling an <a href=\"http:\/\/linuxgizmos.com\/googles-updated-aiy-vision-and-voice-kits-ship-with-raspberry-pi-zero-wh\/\" target=\"new\">AIY Voice Kit<\/a> that runs on the Pi. There\u2019s also a kit that runs on the Orange Pi, said Anavi.<\/p>\n<p>This year, Google has aggressively <a href=\"http:\/\/linuxgizmos.com\/android-things-and-google-assistant-appear-in-new-smart-speakers-smart-displays-and-coms\/\" target=\"new\">courted hardware partners<\/a> to produce home hub devices that combine Assistant with Google\u2019s proprietary <a href=\"http:\/\/linuxgizmos.com\/android-things-exits-beta-with-ota-goodies-and-licensing-restrictions\/\" target=\"new\">Android Things<\/a>. The devices run on a variety of Arm-based SoCs led by the Qualcomm SD212 Home Hub Platform.<\/p>\n<p><a href=\"\/files\/google_homehub.jpg\" target=\"new\"><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/google_homehub-sm.jpg\" \/><\/a> <a href=\"\/files\/lg_xboom_aithinq_wk9.jpg\" target=\"new\"><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/lg_xboom_aithinq_wk9-sm.jpg\" \/><\/a><\/p>\n<p><b>Google Home Hub (left) and LG XBOOM AI ThinQ WK9<\/b><br \/>\n(click images to enlarge)<\/p>\n<p>&nbsp;<\/p>\n<p>The SDK expansion has resulted in a variety of third-party devices running Assistant, including the Lenovo Smart Display and the just released<\/p>\n<p><a href=\"https:\/\/www.engadget.com\/2018\/11\/20\/lg-wk9-google-assistant-smart-speaker\/\" target=\"new\">LG XBOOM AI ThinQ WK9<\/a><\/p>\n<p>touchscreen hubs. Sales of Google Home devices outpaced Echo earlier this year, although Amazon regained the lead in Q3, says Canalys.<\/p>\n<p>Like Alexa, but unlike Mycroft, Google Assistant offers multilingual support. The latest version supports follow-up questions without having to repeat the activation word, and there\u2019s a voice match feature that can recognize up to six users. A new Google Duplex feature accomplishes real-world tasks through natural phone conversations.<\/p>\n<p><b>Mycroft\/Mark<\/b><\/p>\n<p>Anavi\u2019s favorite smart speaker is the Linux-driven, open source (Apache 2.0 and CERN) <a href=\"https:\/\/mycroft.ai\/\" target=\"new\">Mycroft<\/a>. The Raspberry Pi based <a href=\"http:\/\/linuxgizmos.com\/open-source-echo-like-gizmo-is-halfway-to-kickstarter-gold\/\" target=\"new\">Mycroft Mark 1<\/a> speaker was certified by the Open Source Hardware Association (OSHA).<\/p>\n<p>The <a href=\"http:\/\/linuxgizmos.com\/open-source-voice-assistant-promises-user-privacy\/\" target=\"new\">Mycroft Mark II<\/a> launched on Kickstarter in January and has received $450,000 in funding. This Xilinx <a href=\"http:\/\/linuxgizmos.com\/16nm-zynq-soc-mixes-cortex-a53-fpga-cortex-r5\/\" target=\"new\">Zynq UltraScale+ MPSoC<\/a> driven home hub integrates Aaware\u2019s far-field <a href=\"https:\/\/aaware.com\/technology\/\" target=\"new\">Sound Capture<\/a> technology. A <a href=\"https:\/\/www.kickstarter.com\/projects\/aiforeveryone\/mycroft-mark-ii-the-open-voice-assistant\/posts\/2344940\" target=\"new\">Nov. 15 update post<\/a> revealed that the Mark II will miss its December ship date.<\/p>\n<p><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/mycroft_mark2_temp.jpg\" \/><img decoding=\"async\" src=\"http:\/\/linuxgizmos.com\/files\/mycroft_mark2_timer.jpg\" \/><\/p>\n<p><b>Mycroft Mark II weather and timer skills<\/b><\/p>\n<p>Kansas City-based Mycroft has raised $2.5 million from institutional investors and is now seeking funding on<\/p>\n<p><a href=\"https:\/\/www.startengine.com\/mycroft-ai\" target=\"new\">StartEngine<\/a><\/p>\n<p>. Mycroft sees itself as a software company and is encouraging other companies to build the Mycroft Core platform and Mycroft AI voice agent into products. The company offers an enterprise server license to corporate customers for $1,500 a month, and there\u2019s a free, Raspbian based<\/p>\n<p><a href=\"https:\/\/mycroft.ai\/documentation\/picroft\/#hardware-prerequisites\" target=\"new\">Picroft<\/a><\/p>\n<p>application for the Raspberry Pi. A Picroft hardware kit is under consideration.<\/p>\n<p>Mycroft promises that user data will never be saved without an opt-in (to improve machine learning algorithms), and that it will never be used for marketing purposes. Like Alexa and Assistant, however, it\u2019s not available offline without a cloud service, a feature that would better ensure privacy. Anavi says the company is working on an offline option.<\/p>\n<p>The Mycroft AI agent is enabled via a Python based Mycroft Pulse SDK, and a Mycroft Skills Manager is available for Skills development. Like Alexa and Assistant, Mycroft supports custom wake words. The new version uses its homegrown <a href=\"https:\/\/mycroft.ai\/documentation\/precise\/\" target=\"new\">Precise<\/a> wake-word listener technology in place of the earlier PocketSphinx. There\u2019s also an optional device and account management stack called Mycroft Home.<\/p>\n<p>For text-to-speech (TTS), Mycroft defaults to the open source <a href=\"https:\/\/mycroft.ai\/documentation\/mimic\/\" target=\"new\">Mimic<\/a>, which is co-developed with VocaliD. It also supports eSpeak, MaryTTS, Google TTS, and FATTS.<\/p>\n<p>Mycroft lacks its own speech to-text (STT) engine, which Anavi calls \u201cthe biggest challenge for an open source voice assistant.\u201d Instead, it defaults to Google STT and supports <a href=\"http:\/\/linuxgizmos.com\/whipping-up-ibm-watson-voice-services-with-openwhisk\/\" target=\"new\">IBM Watson STT<\/a> and <a href=\"https:\/\/wit.ai\/\" target=\"new\">wit.ai<\/a>.<\/p>\n<p>Mycroft is collaborating with Mozilla on its open source<br \/>\n<a href=\"https:\/\/github.com\/mozilla\/DeepSpeech\" target=\"new\">DeepSpeech<\/a> STT, an open source TensorFlow implementation of <a href=\"http:\/\/research.baidu.com\/Blog\/index-view?id=90\" target=\"new\">Baidu\u2019s DeepSpeech<\/a> platform. Baidu trails Alibaba and Xiaomi in the <a href=\"https:\/\/www.cbinsights.com\/research\/china-voice-assistants-smart-speakers-ai\/\" target=\"new\">Chinese voice assistant<\/a> market but is one of the fastest growing voice AI companies. Just as Alibaba uses its homegrown, Alexa-like AliGenie agent on its Tmall Genie speaker, Baidu loads its <a href=\"https:\/\/www.theverge.com\/ces\/2018\/1\/8\/16866068\/baidu-smart-speakers-dueros-ces-2018\" target=\"new\">speakers<\/a> such as its ceiling-mounted PopIn Alladin with its DeepSpeech-driven <a href=\"https:\/\/dueros.baidu.com\/en\/index.html\" target=\"new\">DuerOS<\/a> voice platform. Xiaomi has used Alexa and Cortana.<\/p>\n<p>Mycroft is the most mature of several alternative voice AI projects that promise improved privacy safeguards. A recent <a href=\"https:\/\/venturebeat.com\/2018\/07\/14\/alexa-alternatives-have-a-secret-weapon-privacy\/\" target=\"new\">VentureBeat<\/a> article reported on emerging privacy-oriented technologies including <a href=\"https:\/\/snips.ai\/\" target=\"new\">Snips<\/a> and <a href=\"https:\/\/soundhound.com\/\" target=\"new\">SoundHound<\/a>.<\/p>\n<p>Anavi concluded with some demo videos showing off his soothing, Bulgarian AI whisperer vocal style. \u201cI try to be polite with these things,\u201d said Anavi. \u201cSomeday they may rule the world and I want to survive.\u201d<\/p>\n<p>Anavi\u2019s ELCE video presentation can be below.<\/p>\n<p><b>Leon Anavi\u2019s \u201cComparison of Voice Assistant SDKs for Embedded Linux Devices\u201d<\/b><\/p>\n<p><i>This article is copyright \u00a9 2018 Linux.com and was originally published <a href=\"https:\/\/www.linux.com\/blog\/2018\/11\/closer-look-voice-assisted-speakers\" target=\"new\">here<\/a>. It has been reproduced by this site with the permission of its owner. Please <a href=\"https:\/\/www.linux.com\" target=\"new\">visit Linux.com<\/a> for up-to-date news and articles about Linux and open source.<\/i><\/p>\n<p><a href=\"http:\/\/linuxgizmos.com\/smart-speaker-voice-platforms-compared\/\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>At the Embedded Linux Conference Europe, Leon Anavi compared the Alexa and Google Assistant voice platforms and looked into open source newcomer Mycroft Mark II. U.S. consumers are expected to drop a bundle this Black Friday on smart speakers and home hubs. A Nov. 15 Canalys report estimates that shipments of voice-assisted speakers grew 137 &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/www.appservgrid.com\/paw92\/index.php\/2018\/11\/21\/smart-speaker-voice-platforms-compared\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Smart speaker voice platforms compared&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-3963","post","type-post","status-publish","format-standard","hentry","category-linux"],"_links":{"self":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts\/3963","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/comments?post=3963"}],"version-history":[{"count":1,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts\/3963\/revisions"}],"predecessor-version":[{"id":4708,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/posts\/3963\/revisions\/4708"}],"wp:attachment":[{"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/media?parent=3963"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/categories?post=3963"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.appservgrid.com\/paw92\/index.php\/wp-json\/wp\/v2\/tags?post=3963"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}