About: Speech recognition

Not logged in : Login

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Speech recognition Goto Sponge Distinct Permalink

An Entity of Type : yago:Communication100033020, within Data Space : ods-qa.openlinksw.com:8896 associated with source document(s)

Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics. It is also known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT).

Attributes	Values
type	yago:YagoPermanentlyLocatedEntity Thing yago:WrittenCommunication106349220 yago:WrittenCommunication106349220 yago:Cognition100023271 yago:PsychologicalFeature100023100 yago:Ability105616246 yago:Act100030358 yago:Address107238694 yago:Code106355894 yago:Code106355894 yago:CodingSystem106353757 yago:CodingSystem106353757 yago:Event100029378 yago:Interface106575227 yago:Know-how105616786 yago:Method105660268 yago:Method105660268 yago:Program106568978 yago:Program106568978 yago:Software106566077 yago:Software106566077 yago:SpeechAct107160883 yago:SpeechAct107160883 yago:Technique105665146 yago:Technique105665146 yago:WikicatSpeeches yago:WikicatUserInterfaceTechniques yago:WikicatUserInterfaceTechniques yago:WikicatUserInterfaces yago:WikicatUserInterfaces yago:Writing106359877 yago:Abstraction100002137 music genre yago:Communication100033020
sameAs	http://el.dbpedia.org/resource/Αναγνώριση_ομιλίας http://eu.dbpedia.org/resource/Hizketaren_ezagutza fbase:m.07970 http://yago-knowledge.org/resource/Speech_recognition http://www.wikidata.org/entity/Q189436 http://it.dbpedia.org/resource/Riconoscimento_vocale http://cs.dbpedia.org/resource/Rozpoznávání_řeči http://de.dbpedia.org/resource/Spracherkennung Reconocimiento del habla Reconocimiento del habla http://fr.dbpedia.org/resource/Reconnaissance_automatique_de_la_parole http://id.dbpedia.org/resource/Pengenalan_ucapan http://ja.dbpedia.org/resource/音声認識 http://ko.dbpedia.org/resource/음성_인식 http://nl.dbpedia.org/resource/Spraakherkenning http://pl.dbpedia.org/resource/Rozpoznawanie_mowy http://pt.dbpedia.org/resource/Reconhecimento_de_fala http://wikidata.dbpedia.org/resource/Q189436 Speech recognition http://api.nytimes.com/svc/semantic/v2/concept/name/nytd_des/Voice%20Recognition%20Systems http://da.dbpedia.org/resource/Talegenkendelse http://mk.dbpedia.org/resource/Препознавање_на_говор http://sv.dbpedia.org/resource/Taligenkänning http://ta.dbpedia.org/resource/பேச்சுணரி http://bn.dbpedia.org/resource/কন্ঠ_সনাক্তকরণ http://eu.dbpedia.org/resource/Hizketa-ezagutze http://ar.dbpedia.org/resource/تعرف_على_الكلام http://be.dbpedia.org/resource/Распазнаванне_маўлення http://ca.dbpedia.org/resource/Reconeixement_de_la_parla http://eo.dbpedia.org/resource/Parolrekonado http://et.dbpedia.org/resource/Kõnetuvastus http://fa.dbpedia.org/resource/بازشناسی_گفتار http://fi.dbpedia.org/resource/Puheentunnistus http://gl.dbpedia.org/resource/Recoñecemento_da_fala http://he.dbpedia.org/resource/מערכת_זיהוי_דיבור http://hr.dbpedia.org/resource/Prepoznavanje_glasa http://hy.dbpedia.org/resource/Խոսքի_Ճանաչում http://is.dbpedia.org/resource/Talgreining http://ms.dbpedia.org/resource/Pengecaman_pertuturan http://no.dbpedia.org/resource/Talegjenkjenning http://ro.dbpedia.org/resource/Recunoaștere_vocală http://ru.dbpedia.org/resource/Распознавание_речи http://simple.dbpedia.org/resource/Speech_recognition http://sk.dbpedia.org/resource/Rozpoznávanie_reči http://sr.dbpedia.org/resource/Препознавање_говора http://uk.dbpedia.org/resource/Розпізнавання_мовлення http://ur.dbpedia.org/resource/کلام_شناسی http://vi.dbpedia.org/resource/Nhận_dạng_tiếng_nói http://zh.dbpedia.org/resource/语音识别 http://tr.dbpedia.org/resource/Ses_konuşma_tanımlayıcı_yazılımlar http://hi.dbpedia.org/resource/श्रुतलेखन_सॉफ्टवेयर http://th.dbpedia.org/resource/การรู้จำคำพูด https://global.dbpedia.org/id/pbBT
wasDerivedFrom	http://en.wikipedia.org/wiki/Speech_recognition?oldid=743745524 http://en.wikipedia.org/wiki/Speech_recognition?oldid=1124851343&ns=0
dbpedia-owl:abstract	Spraakherkenning is een deelgebied van de informatica en computationele taalkunde waarbinnen methoden worden onderzocht en ontwikkeld die het mogelijk maken om automaten, in het bijzonder computers, het gesproken woord te laten herkennen en verwerken. Spraakherkenning moet onderscheiden worden van stemherkenning, een biometrische techniek om een bepaalde persoon aan de hand van zijn stem te kunnen identificeren. De methoden om beide te realiseren zijn echter wel nauw verwant. Il riconoscimento vocale è il processo mediante il quale il linguaggio orale umano viene riconosciuto e successivamente elaborati attraverso un computer o più specificatamente attraverso un apposito sistema di riconoscimento vocale. Sistemi di riconoscimento vocale vengono utilizzati per applicazioni vocali automatizzare nel contesto delle applicazioni telefoniche, ad esempio call center automatici, per sistemi di dettatura (in inglese dictation system), che consentono di dettare discorsi al computer, oppure per sistemi di controllo del sistema di navigazione satellitare o del telefono in auto tramite comandi vocali. 语音识别（speech recognition；語音辨識／言語辨別）技术，也被称为自动语音识别（英语：Automatic Speech Recognition, ASR）、電腦語音識別（英语：Computer Speech Recognition）或是語音轉文本識別（英语：Speech To Text, STT），其目标是以電腦自動将人类的语音内容转换为相應的文字。与说话人识别及说话人确认不同，后者尝试识别或确认发出语音的说话人而非其中所包含的词汇内容。语音识别技术的应用包括语音拨号、语音导航、室内设备控制、语音文档检索、简单的听写数据录入等。语音识别技术与其他自然语言处理技术如机器翻译及语音合成技术相结合，可以构建出更加复杂的应用，例如语音到语音的翻译。语音识别技术所涉及的领域包括：信号处理、模式识别、概率论和信息论、发声机理和听觉机理、人工智能等等。 Tecnologias de reconhecimento da fala (também denominado em alguns aparelhos como reconhecimento de voz) permitem que computadores equipados com microfones interpretem a fala humana, por exemplo, para transcrição ou como método de comando por voz.Tais sistemas podem ser classificados por requererem, ou não, que o usuário treine o sistema a reconhecer seus padrões particulares de fala, por ter a habilidade de reconhecer fala contínua ou por requerer que o usuário fale pausadamente, e pelo tamanho do vocabulário que é capaz de reconhecer (pequeno, da ordem de dezenas a centenas de palavras, ou grande, com milhares de palavras). Sistemas que requerem pouco treinamento podem capturar continuamente a fala com um amplo vocabulário, em ritmo normal, com precisão de cerca de 98% (duas palavras erradas em cem) enquanto sistemas que não requerem treinamento podem reconhecer um número pequeno de palavras como, por exemplo, os dez dígitos do sistema decimal. Tais sistemas são populares por direcionar chamadas telefônicas recebidas, em grandes organizações, aos seus destinos. Sistemas comerciais para reconhecimento da fala têm estado disponíveis desde os anos 90, porém é interessante notar que, apesar do aparente sucesso dessa tecnologia, poucas pessoas os usam. Parece que a maioria dos usuários de computador pode criar e editar documentos mais rapidamente com um teclado convencional, apesar do fato de que muitas pessoas são capazes de falar consideravelmente mais rápido do que podem digitar. Além disso, o uso intenso dos órgãos da fala pode resultar em sobrecarga vocal. Alguns dos problemas técnicos chaves do reconhecimento da fala são: * Diferenças entre os interlocutores são freqüentemente grandes e dificultam. Não está claro quais características da fala são independentes do falante. * A interpretação de vários fonemas, palavras e frases é sensível ao contexto. Por exemplo: os fonemas são geralmente mais curtos em palavras longas do que em palavras pequenas. As palavras têm significados diferentes em frases diferentes. Por exemplo: "Philip lies" pode ser interpretado como Philip sendo um mentiroso ou como Philip deitando-se na cama. * A entonação e o timbre da fala podem mudar completamente a interpretação de uma palavra ou frase. Por exemplo: "Vai!", "Vai?" e "Vai." podem ser claramente reconhecidos por um humano, mas não tão facilmente por um computador. * Palavras e frases podem ter várias interpretações válidas de modo que o falante deixe a escolha da correta para o ouvinte. * A linguagem escrita precisa de pontuação de acordo com regras estritas que não estão fortemente presentes na fala e são difíceis de inferir sem conhecer o significado (vírgulas, fim de frase, citações). O entendimento do significado das palavras ditas é pensado como um campo separado do entendimento natural da linguagem. Há vários exemplos de frases que soam iguais e só podem ser desambiguadas pela aparição do contexto: uma famosa camisa vestida por pesquisadores da Apple Inc. dizia "I helped Apple wreck a nice beach" [Eu ajudei a Apple a destruir uma bela praia], o que, quando pronunciado, soa como "I helped Apple recognize speech" [Eu ajudei a Apple a reconhecer a fala]. Uma solução geral para muitos dos problemas acima requer efetivamente conhecimento humano, experiência e uma avançada tecnologia em inteligência artificial. Especificamente, modelos estatísticos de linguagem são freqüentemente empregados para desambiguação e melhoramento da precisão do reconhecimento. El reconocimiento automático del habla (RAH) o reconocimiento automático de voz es una disciplina de la inteligencia artificial que tiene como objetivo permitir la comunicación hablada entre seres humanos y computadoras. El problema que se plantea en un sistema de este tipo es el de hacer cooperar un conjunto de informaciones que provienen de diversas fuentes de conocimiento (acústica, fonética, fonológica, léxica, sintáctica, semántica y pragmática), en presencia de ambigüedades, incertidumbres y errores inevitables para llegar a obtener una interpretación aceptable del mensaje acústico recibido. Un sistema de reconocimiento de voz es una herramienta computacional capaz de procesar la señal de voz emitida por el ser humano y reconocer la información contenida en ésta, convirtiéndola en texto o emitiendo órdenes que actúan sobre un proceso. En su desarrollo intervienen diversas disciplinas, tales como: la fisiología, la acústica, la lingüística, el procesamiento de señales, la inteligencia artificial y la ciencia de la computación. Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics. It is also known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT). Some SR systems use "training" (also called "enrollment") where an individual speaker reads text or isolated vocabulary into the system. The system analyzes the person's specific voice and uses it to fine-tune the recognition of that person's speech, resulting in increased accuracy. Systems that do not use training are called "speaker independent" systems. Systems that use training are called "speaker dependent". Speech recognition applications include voice user interfaces such as voice dialing (e.g. "Call home"), call routing (e.g. "I would like to make a collect call"), domotic appliance control, search (e.g. find a podcast where particular words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents (e.g. a radiology report), speech-to-text processing (e.g., word processors or emails), and aircraft (usually termed Direct Voice Input). The term voice recognition or speaker identification refers to identifying the speaker, rather than what they are saying. Recognizing the speaker can simplify the task of translating speech in systems that have been trained on a specific person's voice or it can be used to authenticate or verify the identity of a speaker as part of a security process. From the technology perspective, speech recognition has a long history with several waves of major innovations. Most recently, the field has benefited from advances in deep learning and big data. The advances are evidenced not only by the surge of academic papers published in the field, but more importantly by the world-wide industry adoption of a variety of deep learning methods in designing and deploying speech recognition systems. These speech industry players include Google, Microsoft, Hewlett Packard Enterprise, IBM, Baidu (China), Apple, Amazon, Nuance, IflyTek (China), many of which have publicized the core technology in their speech recognition systems as being based on deep learning. Die Spracherkennung oder auch automatische Spracherkennung ist ein Teilgebiet der angewandten Informatik, der Ingenieurwissenschaften und der Computerlinguistik. Sie beschäftigt sich mit der Untersuchung und Entwicklung von Verfahren, die Automaten, insbesondere Computern, die gesprochene Sprache der automatischen Datenerfassung zugänglich macht. Die Spracherkennung ist zu unterscheiden von der Stimm- bzw. Sprechererkennung, einem biometrischen Verfahren zur Personenidentifikation. Allerdings ähneln sich die Realisierungen dieser Verfahren. 25بك المحتوى هنا ينقصه الاستشهاد بمصادر. يرجى إيراد مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (فبراير 2016) التعرف على الكلام أو تمييز الكلام ( ويعرف أيضا بتمييز الكلام التلقائي أو تمييز الكلام أو حاسب تمييز الكلام) وهو عبارة عن تحويل الكلمات المنطوقة إلى نص. إن مصطلح " تمييز الصوت" في بعض الأحيان يطلق على أنظمة التمييز التي يجب أن تدرب على متحدث معين، كما هو الحال بالنسبة لمعظم برامج تمييز سطح المكتب. التعرف على المتحدث يستطيع تبسيط مهمة ترجمة الكلام.تمييز الكلام يعتبر حل أوسع يشير إلى تكنولوجيا بإمكانها التعرف على الكلام بدون أن تستهدف متحدث واحد – مثل نظام الاتصال الذي يستطيع التعرف على جميع الأصوات.تطبيقات تمييز الكلام تتضمن: واجهة المستخدم الصوتية مثل الطلب الصوتي ( على سبيل المثال: اتصل بالمنزل )؛ توجيه المكالمات ( على سبيل المثال: أريد عمل مكالمة تليفونية على حساب المتلقي )، التحكم بتطبيق أتمتة المنزل، البحث ( على سبيل المثال: أوجد البودكاست حيث الكلمات كانت منطوقة) إدخال بيانات بسيطة ( على سبيل المثال: ادخل رقم البطاقة الائتمانية )، إعداد وثائق منظمة ( مثل: تقرير الأشعة)، خطاب معالجة النصوص ( مثل: معالج الكلمات " Word " أو رسائل البريد الالكتروني ) والمركبة الجوية (مثل: أجهزة الإدخال المباشر). Rozpoznawanie mowy – technologia pozwalająca komputerowi lub innemu urządzeniu interpretować mowę ludzką, na przykład do celów transkrypcji lub jako alternatywną metodę interakcji. Dla języka polskiego (stan na rok 2008) dostępne są programy rozpoznające poprawnie 5-9 na 10 wypowiedzianych słów mowy ciągłej (na współczynnik ten, oprócz jakości algorytmu, wpływają m.in. wyrazistość i zrozumiałość mowy). Wartości skuteczności systemów rozpoznawania mowy bardzo zależą od przyjętego scenariusza testu. Dlatego informacje liczbowe, wbrew intuicji, zwykle nie są dobrym odzwierciedleniem jakości takich systemów. Najskuteczniejszą metodą jest porównanie dwóch lub więcej systemów na takim samym scenariuszu testowym. Jakość systemów może jednak także zależeć od tego jak sygnał jest rejestrowany. Przykładowo wiele z systemów oferowanych dla języka polskiego działa dużo gorzej dla sygnału z sieci GSM. Ogólnie należy przyjąć, że rozpoznawanie mowy polskiej działa poprawnie tylko dla pojedynczych słów lub dla ustalonych zbiorów scenariuszy dialogów (stan na marzec 2014). Próg komercyjnej akceptowalności systemów rozpoznawania mowy zwykle przyjmuje się jako 95% poprawności rozpoznania. 音声認識（おんせいにんしき、英: speech recognition）とは、人間の声などをコンピューターに認識させることであり、話し言葉を文字列に変換したり、あるいは音声の特徴をとらえて声を出している人を識別する機能を指す。

Faceted Search & Find service v1.17_git55 as of Mar 01 2021

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3322 as of Mar 14 2022, on Linux (x86_64-generic-linux-glibc25), Single-Server Edition (7 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software