About: Speech recognition

Not logged in : Login

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Speech recognition Goto Sponge NotDistinct Permalink

An Entity of Type : yago:Communication100033020, within Data Space : ods-qa.openlinksw.com:8896 associated with source document(s)

Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics. It is also known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT).

Attributes	Values
type	yago:YagoPermanentlyLocatedEntity Thing yago:WrittenCommunication106349220 yago:Cognition100023271 yago:PsychologicalFeature100023100 yago:Ability105616246 yago:Act100030358 yago:Address107238694 yago:Code106355894 yago:CodingSystem106353757 yago:Event100029378 yago:Interface106575227 yago:Know-how105616786 yago:Method105660268 yago:Program106568978 yago:Software106566077 yago:SpeechAct107160883 yago:Technique105665146 yago:WikicatSpeeches yago:WikicatUserInterfaceTechniques yago:WikicatUserInterfaces yago:Writing106359877 yago:Abstraction100002137 music genre yago:Communication100033020
sameAs	http://el.dbpedia.org/resource/Αναγνώριση_ομιλίας http://eu.dbpedia.org/resource/Hizketaren_ezagutza fbase:m.07970 http://yago-knowledge.org/resource/Speech_recognition http://www.wikidata.org/entity/Q189436 http://it.dbpedia.org/resource/Riconoscimento_vocale http://cs.dbpedia.org/resource/Rozpoznávání_řeči http://de.dbpedia.org/resource/Spracherkennung Reconocimiento del habla http://fr.dbpedia.org/resource/Reconnaissance_automatique_de_la_parole http://id.dbpedia.org/resource/Pengenalan_ucapan http://ja.dbpedia.org/resource/音声認識 http://ko.dbpedia.org/resource/음성_인식 http://nl.dbpedia.org/resource/Spraakherkenning http://pl.dbpedia.org/resource/Rozpoznawanie_mowy http://pt.dbpedia.org/resource/Reconhecimento_de_fala http://wikidata.dbpedia.org/resource/Q189436 Speech recognition http://api.nytimes.com/svc/semantic/v2/concept/name/nytd_des/Voice%20Recognition%20Systems http://da.dbpedia.org/resource/Talegenkendelse http://mk.dbpedia.org/resource/Препознавање_на_говор http://sv.dbpedia.org/resource/Taligenkänning http://ta.dbpedia.org/resource/பேச்சுணரி http://bn.dbpedia.org/resource/কন্ঠ_সনাক্তকরণ http://eu.dbpedia.org/resource/Hizketa-ezagutze http://ar.dbpedia.org/resource/تعرف_على_الكلام http://be.dbpedia.org/resource/Распазнаванне_маўлення http://ca.dbpedia.org/resource/Reconeixement_de_la_parla http://eo.dbpedia.org/resource/Parolrekonado http://et.dbpedia.org/resource/Kõnetuvastus http://fa.dbpedia.org/resource/بازشناسی_گفتار http://fi.dbpedia.org/resource/Puheentunnistus http://gl.dbpedia.org/resource/Recoñecemento_da_fala http://he.dbpedia.org/resource/מערכת_זיהוי_דיבור http://hr.dbpedia.org/resource/Prepoznavanje_glasa http://hy.dbpedia.org/resource/Խոսքի_Ճանաչում http://is.dbpedia.org/resource/Talgreining http://ms.dbpedia.org/resource/Pengecaman_pertuturan http://no.dbpedia.org/resource/Talegjenkjenning http://ro.dbpedia.org/resource/Recunoaștere_vocală http://ru.dbpedia.org/resource/Распознавание_речи http://simple.dbpedia.org/resource/Speech_recognition http://sk.dbpedia.org/resource/Rozpoznávanie_reči http://sr.dbpedia.org/resource/Препознавање_говора http://uk.dbpedia.org/resource/Розпізнавання_мовлення http://ur.dbpedia.org/resource/کلام_شناسی http://vi.dbpedia.org/resource/Nhận_dạng_tiếng_nói http://zh.dbpedia.org/resource/语音识别 http://tr.dbpedia.org/resource/Ses_konuşma_tanımlayıcı_yazılımlar http://hi.dbpedia.org/resource/श्रुतलेखन_सॉफ्टवेयर http://th.dbpedia.org/resource/การรู้จำคำพูด https://global.dbpedia.org/id/pbBT
wasDerivedFrom	http://en.wikipedia.org/wiki/Speech_recognition?oldid=743745524 http://en.wikipedia.org/wiki/Speech_recognition?oldid=1124851343&ns=0
dbpedia-owl:abstract	Spraakherkenning is een deelgebied van de informatica en computationele taalkunde waarbinnen methoden worden onderzocht en ontwikkeld die het mogelijk maken om automaten, in het bijzonder computers, het gesproken woord te laten herkennen en verwerken. Spraakherkenning moet onderscheiden worden van stemherkenning, een biometrische techniek om een bepaalde persoon aan de hand van zijn stem te kunnen identificeren. De methoden om beide te realiseren zijn echter wel nauw verwant. Il riconoscimento vocale è il processo mediante il quale il linguaggio orale umano viene riconosciuto e successivamente elaborati attraverso un computer o più specificatamente attraverso un apposito sistema di riconoscimento vocale. Sistemi di riconoscimento vocale vengono utilizzati per applicazioni vocali automatizzare nel contesto delle applicazioni telefoniche, ad esempio call center automatici, per sistemi di dettatura (in inglese dictation system), che consentono di dettare discorsi al computer, oppure per sistemi di controllo del sistema di navigazione satellitare o del telefono in auto tramite comandi vocali. 语音识别（speech recognition；語音辨識／言語辨別）技术，也被称为自动语音识别（英语：Automatic Speech Recognition, ASR）、電腦語音識別（英语：Computer Speech Recognition）或是語音轉文本識別（英语：Speech To Text, STT），其目标是以電腦自動将人类的语音内容转换为相應的文字。与说话人识别及说话人确认不同，后者尝试识别或确认发出语音的说话人而非其中所包含的词汇内容。语音识别技术的应用包括语音拨号、语音导航、室内设备控制、语音文档检索、简单的听写数据录入等。语音识别技术与其他自然语言处理技术如机器翻译及语音合成技术相结合，可以构建出更加复杂的应用，例如语音到语音的翻译。语音识别技术所涉及的领域包括：信号处理、模式识别、概率论和信息论、发声机理和听觉机理、人工智能等等。 Tecnologias de reconhecimento da fala (também denominado em alguns aparelhos como reconhecimento de voz) permitem que computadores equipados com microfones interpretem a fala humana, por exemplo, para transcrição ou como método de comando por voz.Tais sistemas podem ser classificados por requererem, ou não, que o usuário treine o sistema a reconhecer seus padrões particulares de fala, por ter a habilidade de reconhecer fala contínua ou por requerer que o usuário fale pausadamente, e pelo tamanho do vocabulário que é capaz de reconhecer (pequeno, da ordem de dezenas a centenas de palavras, ou grande, com milhares de palavras). Sistemas que requerem pouco treinamento podem capturar continuamente a fala com um amplo vocabulário, em ritmo normal, com precisão de cerca de 98% (duas palavras erradas em cem) enquanto sistemas que não requerem treinamento podem reconhecer um número pequeno de palavras como, por exemplo, os dez dígitos do sistema decimal. Tais sistemas são populares por direcionar chamadas telefônicas recebidas, em grandes organizações, aos seus destinos. Sistemas comerciais para reconhecimento da fala têm estado disponíveis desde os anos 90, porém é interessante notar que, apesar do aparente sucesso dessa tecnologia, poucas pessoas os usam. Parece que a maioria dos usuários de computador pode criar e editar documentos mais rapidamente com um teclado convencional, apesar do fato de que muitas pessoas são capazes de falar consideravelmente mais rápido do que podem digitar. Além disso, o uso intenso dos órgãos da fala pode resultar em sobrecarga vocal. Alguns dos problemas técnicos chaves do reconhecimento da fala são: * Diferenças entre os interlocutores são freqüentemente grandes e dificultam. Não está claro quais características da fala são independentes do falante. * A interpretação de vários fonemas, palavras e frases é sensível ao contexto. Por exemplo: os fonemas são geralmente mais curtos em palavras longas do que em palavras pequenas. As palavras têm significados diferentes em frases diferentes. Por exemplo: "Philip lies" pode ser interpretado como Philip sendo um mentiroso ou como Philip deitando-se na cama. * A entonação e o timbre da fala podem mudar completamente a interpretação de uma palavra ou frase. Por exemplo: "Vai!", "Vai?" e "Vai." podem ser claramente reconhecidos por um humano, mas não tão facilmente por um computador. * Palavras e frases podem ter várias interpretações válidas de modo que o falante deixe a escolha da correta para o ouvinte. * A linguagem escrita precisa de pontuação de acordo com regras estritas que não estão fortemente presentes na fala e são difíceis de inferir sem conhecer o significado (vírgulas, fim de frase, citações). O entendimento do significado das palavras ditas é pensado como um campo separado do entendimento natural da linguagem. Há vários exemplos de frases que soam iguais e só podem ser desambiguadas pela aparição do contexto: uma famosa camisa vestida por pesquisadores da Apple Inc. dizia "I helped Apple wreck a nice beach" [Eu ajudei a Apple a destruir uma bela praia], o que, quando pronunciado, soa como "I helped Apple recognize speech" [Eu ajudei a Apple a reconhecer a fala]. Uma solução geral para muitos dos problemas acima requer efetivamente conhecimento humano, experiência e uma avançada tecnologia em inteligência artificial. Especificamente, modelos estatísticos de linguagem são freqüentemente empregados para desambiguação e melhoramento da precisão do reconhecimento. El reconocimiento automático del habla (RAH) o reconocimiento automático de voz es una disciplina de la inteligencia artificial que tiene como objetivo permitir la comunicación hablada entre seres humanos y computadoras. El problema que se plantea en un sistema de este tipo es el de hacer cooperar un conjunto de informaciones que provienen de diversas fuentes de conocimiento (acústica, fonética, fonológica, léxica, sintáctica, semántica y pragmática), en presencia de ambigüedades, incertidumbres y errores inevitables para llegar a obtener una interpretación aceptable del mensaje acústico recibido. Un sistema de reconocimiento de voz es una herramienta computacional capaz de procesar la señal de voz emitida por el ser humano y reconocer la información contenida en ésta, convirtiéndola en texto o emitiendo órdenes que actúan sobre un proceso. En su desarrollo intervienen diversas disciplinas, tales como: la fisiología, la acústica, la lingüística, el procesamiento de señales, la inteligencia artificial y la ciencia de la computación. Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics. It is also known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT). Some SR systems use "training" (also called "enrollment") where an individual speaker reads text or isolated vocabulary into the system. The system analyzes the person's specific voice and uses it to fine-tune the recognition of that person's speech, resulting in increased accuracy. Systems that do not use training are called "speaker independent" systems. Systems that use training are called "speaker dependent". Speech recognition applications include voice user interfaces such as voice dialing (e.g. "Call home"), call routing (e.g. "I would like to make a collect call"), domotic appliance control, search (e.g. find a podcast where particular words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents (e.g. a radiology report), speech-to-text processing (e.g., word processors or emails), and aircraft (usually termed Direct Voice Input). The term voice recognition or speaker identification refers to identifying the speaker, rather than what they are saying. Recognizing the speaker can simplify the task of translating speech in systems that have been trained on a specific person's voice or it can be used to authenticate or verify the identity of a speaker as part of a security process. From the technology perspective, speech recognition has a long history with several waves of major innovations. Most recently, the field has benefited from advances in deep learning and big data. The advances are evidenced not only by the surge of academic papers published in the field, but more importantly by the world-wide industry adoption of a variety of deep learning methods in designing and deploying speech recognition systems. These speech industry players include Google, Microsoft, Hewlett Packard Enterprise, IBM, Baidu (China), Apple, Amazon, Nuance, IflyTek (China), many of which have publicized the core technology in their speech recognition systems as being based on deep learning. Die Spracherkennung oder auch automatische Spracherkennung ist ein Teilgebiet der angewandten Informatik, der Ingenieurwissenschaften und der Computerlinguistik. Sie beschäftigt sich mit der Untersuchung und Entwicklung von Verfahren, die Automaten, insbesondere Computern, die gesprochene Sprache der automatischen Datenerfassung zugänglich macht. Die Spracherkennung ist zu unterscheiden von der Stimm- bzw. Sprechererkennung, einem biometrischen Verfahren zur Personenidentifikation. Allerdings ähneln sich die Realisierungen dieser Verfahren. 25بك المحتوى هنا ينقصه الاستشهاد بمصادر. يرجى إيراد مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (فبراير 2016) التعرف على الكلام أو تمييز الكلام ( ويعرف أيضا بتمييز الكلام التلقائي أو تمييز الكلام أو حاسب تمييز الكلام) وهو عبارة عن تحويل الكلمات المنطوقة إلى نص. إن مصطلح " تمييز الصوت" في بعض الأحيان يطلق على أنظمة التمييز التي يجب أن تدرب على متحدث معين، كما هو الحال بالنسبة لمعظم برامج تمييز سطح المكتب. التعرف على المتحدث يستطيع تبسيط مهمة ترجمة الكلام.تمييز الكلام يعتبر حل أوسع يشير إلى تكنولوجيا بإمكانها التعرف على الكلام بدون أن تستهدف متحدث واحد – مثل نظام الاتصال الذي يستطيع التعرف على جميع الأصوات.تطبيقات تمييز الكلام تتضمن: واجهة المستخدم الصوتية مثل الطلب الصوتي ( على سبيل المثال: اتصل بالمنزل )؛ توجيه المكالمات ( على سبيل المثال: أريد عمل مكالمة تليفونية على حساب المتلقي )، التحكم بتطبيق أتمتة المنزل، البحث ( على سبيل المثال: أوجد البودكاست حيث الكلمات كانت منطوقة) إدخال بيانات بسيطة ( على سبيل المثال: ادخل رقم البطاقة الائتمانية )، إعداد وثائق منظمة ( مثل: تقرير الأشعة)، خطاب معالجة النصوص ( مثل: معالج الكلمات " Word " أو رسائل البريد الالكتروني ) والمركبة الجوية (مثل: أجهزة الإدخال المباشر). Rozpoznawanie mowy – technologia pozwalająca komputerowi lub innemu urządzeniu interpretować mowę ludzką, na przykład do celów transkrypcji lub jako alternatywną metodę interakcji. Dla języka polskiego (stan na rok 2008) dostępne są programy rozpoznające poprawnie 5-9 na 10 wypowiedzianych słów mowy ciągłej (na współczynnik ten, oprócz jakości algorytmu, wpływają m.in. wyrazistość i zrozumiałość mowy). Wartości skuteczności systemów rozpoznawania mowy bardzo zależą od przyjętego scenariusza testu. Dlatego informacje liczbowe, wbrew intuicji, zwykle nie są dobrym odzwierciedleniem jakości takich systemów. Najskuteczniejszą metodą jest porównanie dwóch lub więcej systemów na takim samym scenariuszu testowym. Jakość systemów może jednak także zależeć od tego jak sygnał jest rejestrowany. Przykładowo wiele z systemów oferowanych dla języka polskiego działa dużo gorzej dla sygnału z sieci GSM. Ogólnie należy przyjąć, że rozpoznawanie mowy polskiej działa poprawnie tylko dla pojedynczych słów lub dla ustalonych zbiorów scenariuszy dialogów (stan na marzec 2014). Próg komercyjnej akceptowalności systemów rozpoznawania mowy zwykle przyjmuje się jako 95% poprawności rozpoznania. 音声認識（おんせいにんしき、英: speech recognition）とは、人間の声などをコンピューターに認識させることであり、話し言葉を文字列に変換したり、あるいは音声の特徴をとらえて声を出している人を識別する機能を指す。 Распознавание речи — процесс преобразования речевого сигнала в цифровую информацию (например, текстовые данные). Обратной задачей является синтез речи. La reconnaissance automatique de la parole (souvent improprement appelée reconnaissance vocale) est une technique informatique qui permet d'analyser la voix humaine captée au moyen d'un microphone pour la transcrire sous la forme d'un texte exploitable par une machine. La reconnaissance de la parole, ainsi que la synthèse de la parole, l'identification du locuteur ou la vérification du locuteur, font partie des techniques de traitement de la parole. Ces techniques permettent notamment de réaliser des interfaces homme-machine (IHM) où une partie de l'interaction se fait à la voix : « interfaces vocales », Parmi les nombreuses applications, on peut citer les applications de dictée vocale sur ordinateur où la difficulté tient à la taille du vocabulaire et à la longueur des phrases, mais aussi les applications téléphoniques de type serveur vocal interactif, où la difficulté tient plutôt à la nécessité de reconnaître n'importe quelle voix dans des conditions acoustiques variables et souvent bruyantes (téléphones mobiles dans des lieux publics). Dans Parole et dialogue homme-machine, W. Minker et S. Bennacef expliquent que la reconnaissance automatique de la parole est un domaine complexe, car il existe une différence importante entre le langage formel, qui est compris et utilisé par les machines, et le langage naturel, que les humains utilisent. Le langage formel est structuré par des règles syntaxiques strictes et sans ambigüité. À l'inverse, dans le langage naturel, des mots ou des phrases peuvent avoir plusieurs sens selon l'intonation de l'énonciateur ou le contexte par exemple. Il riconoscimento vocale è il processo mediante il quale il linguaggio orale umano viene riconosciuto e successivamente elaborato attraverso un computer o più specificatamente attraverso un apposito sistema di riconoscimento vocale. Sistemi di riconoscimento vocale vengono utilizzati per applicazioni vocali automatizzate nel contesto delle applicazioni telefoniche, ad esempio call center automatici, per sistemi di dettatura (in inglese dictation system), che consentono di dettare discorsi al computer, oppure per sistemi di controllo del sistema di navigazione satellitare o del telefono in auto tramite comandi vocali. Pengenalan ucapan atau pengenalan wicara—dalam istilah bahasa Inggrisnya, automatic speech recognition (ASR)—adalah suatu pengembangan teknik dan sistem yang memungkinkan komputer untuk menerima masukan berupa kata yang diucapkan. Teknologi ini memungkinkan suatu perangkat untuk mengenali dan memahami kata-kata yang diucapkan dengan cara digitalisasi kata dan mencocokkan sinyal digital tersebut dengan suatu pola tertentu yang tersimpan dalam suatu perangkat. Kata-kata yang diucapkan diubah bentuknya menjadi sinyal digital dengan cara mengubah gelombang suara menjadi sekumpulan angka yang kemudian disesuaikan dengan kode-kode tertentu untuk mengidentifikasikan kata-kata tersebut. Hasil dari identifikasi kata yang diucapkan dapat ditampilkan dalam bentuk tulisan atau dapat dibaca oleh perangkat teknologi sebagai sebuah komando untuk melakukan suatu pekerjaan, misalnya penekanan tombol pada telepon genggam yang dilakukan secara otomatis dengan komando suara. Alat pengenal ucapan, yang sering disebut dengan speech recognizer, membutuhkan sampel kata sebenarnya yang diucapkan dari pengguna. Sampel kata akan didigitalisasi, disimpan dalam komputer, dan kemudian digunakan sebagai basis data dalam mencocokkan kata yang diucapkan selanjutnya. Sebagian besar sifatnya masih tergantung kepada pembicara. Alat ini hanya dapat mengenal kata yang diucapkan dari satu atau dua orang saja dan hanya bisa mengenal kata-kata terpisah, yaitu kata-kata yang dalam penyampaiannya terdapat jeda antar kata. Hanya sebagian kecil dari peralatan yang menggunakan teknologi ini yang sifatnya tidak tergantung pada pembicara. Alat ini sudah dapat mengenal kata yang diucapkan oleh banyak orang dan juga dapat mengenal kata-kata kontinu, atau kata-kata yang dalam penyampaiannya tidak terdapat jeda antar kata. Pengenalan ucapan dalam perkembangan teknologinya merupakan bagian dari pengenalan suara (proses identifikasi seseorang berdasarkan suaranya). Pengenalan suara sendiri terbagi menjadi dua, yaitu pengenalan pembicara (identifikasi suara berdasarkan orang yang berbicara) dan pengenalan ucapan (identifikasi suara berdasarkan kata yang diucapkan). El reconocimiento automático del habla (RAH) o reconocimiento automático de voz es una disciplina de la inteligencia artificial que tiene como objetivo permitir la comunicación hablada entre seres humanos y computadoras. El problema que se plantea en un sistema de este tipo es el de hacer cooperar un conjunto de informaciones que provienen de diversas fuentes de conocimiento (acústica, fonética, fonológica, léxica, sintáctica, semántica y pragmática), en presencia de ambigüedades, incertidumbres y errores inevitables para llegar a obtener una interpretación aceptable del mensaje acústico recibido. Un sistema de reconocimiento de voz es una herramienta computacional capaz de procesar la señal de voz emitida por el ser humano y reconocer la información contenida en esta, convirtiéndola en texto o emitiendo órdenes que actúan sobre un proceso. En su desarrollo intervienen diversas disciplinas, tales como: la fisiología, la acústica, la lingüística, el procesamiento de señales, la inteligencia artificial y la ciencia de la computación. 음성 인식(Speech Recognition)이란 사람이 말하는 음성 언어를 컴퓨터가 해석해 그 내용을 문자 데이터로 전환하는 처리를 말한다. STT(Speech-to-Text)라고도 한다. 키보드 대신 문자를 입력하는 방식으로 주목을 받고 있다. 로봇, 텔레매틱스 등 음성으로 기기제어, 정보검색이 필요한 경우에 응용된다. 대표적인 알고리즘은 HMM(Hidden Markov Model)으로서, 다양한 화자들이 발성한 음성들을 통계적으로 모델링하여 음향모델을 구성하며 말뭉치 수집을 통하여 언어모델을 구성한다. 미리 기록해 둔 음성 패턴과 비교해 개인 인증 등의 용도로 사용하기도 하는데 이를 화자 인식이라고 한다. Die Spracherkennung oder auch automatische Spracherkennung ist ein Verfahren und ein Teilgebiet der angewandten Informatik, der Ingenieurwissenschaften und der Computerlinguistik. Sie beschäftigt sich mit der Untersuchung und Entwicklung von Verfahren, die Automaten, insbesondere Computern, die gesprochene Sprache der automatischen Datenerfassung zugänglich macht. So lassen sich beispielsweise aus Tonspuren durchsuchbare Transkripte erstellen. Die Spracherkennung ist zu unterscheiden von der Stimm- bzw. Sprechererkennung, einem biometrischen Verfahren zur Personenidentifikation. Allerdings ähneln sich die Realisierungen dieser Verfahren. Розпізнава́ння мо́влення (англ. speech recognition) або мо́влення-у-те́кст (англ. speech to text (STT))— процес перетворення мовленнєвого сигналу в текстовий потік. Не варто плутати із визначенням розпізнавання мови, оскільки «розпізнати мову» безпосередньо означає лише дати відповідь на питання, до якої мови належить сегмент мовленнєвого сигналу. Часто використовується у наборі технологій, що дають змогу керувати комп'ютером, використовуючи людський голос, вводити інформацію голосом, диктувати, транскрибувати (стенографувати) фонограми. La reconnaissance automatique de la parole (souvent improprement appelée reconnaissance vocale) est une technique informatique qui permet d'analyser la voix humaine captée au moyen d'un microphone pour la transcrire sous la forme d'un texte exploitable par une machine. La reconnaissance de la parole, ainsi que la synthèse de la parole, l'identification du locuteur ou la vérification du locuteur, font partie des techniques de traitement de la parole. Ces techniques permettent notamment de réaliser des interfaces homme-machine (IHM) où une partie de l'interaction se fait à la voix : « interfaces vocales ». Parmi les nombreuses applications, on peut citer les applications de dictée vocale sur ordinateur où la difficulté tient à la taille du vocabulaire et à la longueur des phrases, mais aussi les applications téléphoniques de type serveur vocal interactif, où la difficulté tient plutôt à la nécessité de reconnaître n'importe quelle voix dans des conditions acoustiques variables et souvent bruyantes (téléphones mobiles dans des lieux publics). Dans Parole et dialogue homme-machine, W. Minker et S. Bennacef expliquent que la reconnaissance automatique de la parole est un domaine complexe, car il existe une différence importante entre le langage formel, qui est compris et utilisé par les machines, et le langage naturel, que les humains utilisent. Le langage formel est structuré par des règles syntaxiques strictes et sans ambigüité. À l'inverse, dans le langage naturel, des mots ou des phrases peuvent avoir plusieurs sens selon l'intonation de l'énonciateur ou le contexte par exemple. 音声認識（おんせいにんしき、英: speech recognition）とは、人間の声などをコンピューターに認識させることであり、話し言葉を文字列に変換したり、あるいは音声の特徴をとらえて声を出している人を識別する機能を指す。自動音声認識（英: Automatic Speech Recognition; ASR）とも。 El reconeixement automàtic de la parla (RAP) o reconeixement automàtic de veu és una part de la intel·ligència artificial que té com a objectiu permetre la comunicació parlada entre éssers humans i computadores electròniques. Un sistema de reconeixement de veu és una eina computacional, capaç de processar el senyal de veu i reconèixer la informació que porta. Les disciplines que intervenen en aquest procés són, la fisiologia, l'acústica, el processament de senyal (quantificació), la intel·ligència artificial i la ciència computacional. El principal problema que es planteja en un sistema de RAP és el de fer cooperar un conjunt d'informacions que provenen de diverses fonts de coneixement: acústica, fonètica, fonològica, lèxica, sintàctica, semàntica i pragmàtica); en presència d'ambigüitats, incerteses i errors inevitables per arribar a obtenir una interpretació acceptable del missatge acústic rebut. Es tracta d'una tecnologia que ha experimentat un major avanç en els últims anys, passant de poder reconèixer només a un parlant, dins un vocabulari limitat, fins a prototips que poden reconèixer qualsevol parlant sobre vocabularis flexibles de milers de paraules. El procés de RAP intenta aconseguir una seqüència de paraules que corresponguin a la frase en el llenguatge natural d'entrada. La frase és pronunciada de forma contínua, sense pauses entre les paraules. D'aquesta manera no es generen problemes gramaticals. Per aquest motiu, aquests sistemes són força costosos en concepte de memòria de càlcul.

Faceted Search & Find service v1.17_git55 as of Mar 01 2021

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3322 as of Mar 14 2022, on Linux (x86_64-generic-linux-glibc25), Single-Server Edition (7 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software