.

Wednesday, March 6, 2019

Artificial Intelligence for Speech Recognition Essay

ABSTRACTWhen you dial the earphone number of a big comp any(prenominal), you argon likely to hear the sonorous function of a cultured maam who responds to your call with great courtesy saying welcome to company X. ravish give me the extension number you want .You pronounces the extension number, your name, and the name of the someone you want to contact. If the called person accepts the call, the connection is given quickly. This is artificial cognition where an reflex(a) call-handling system is employd without employing any bid operator. stilted Intelligence (AI) involves dickens basic ideas. First, it involves studying the thought processes of human beings. Second, it deals with representing those processes via machines ( electronic ready reckoners, robots, etc).AI is the behavior of a machine, which, if performed by a human being, would be called intelligent. It makes machines smarter and more habituateful, is less expensive than inborn intelligence.Natural Language Processing (NLP) refers to Artificial Intelligence methods of Communicating with a computer in a natural language like English. The main(prenominal) objective of a NLP program is to understand stimulant and protrude action. The input run-in atomic number 18 scanned and matched against internally stored known enunciates. Identification of a keyword ca usances some action to be taken. In this way, one can intercommunicate with computer in ones language. One of the main practical lotion of AI is nomenclature acknowledgment system is that it lets user do different works simultaneously. The terminology recognition process is performed by a package component known as the words recognition engine.A dialect recognition system is a type of software that allows the user to nominate their mouth words converted into compose text edition in a computer application such as a word central processing unit or spreadsheet. The computer can in any case be controlled by the use of verbalize commands. As we cant design electronic twisting which recognizes eeryones voice, ground on that it is divided into speaker dependency and speaker independency. The working of the system involves ADC, comparison of this binary version with the stored words. The limitations for this are essential be completely trained by the user, most successful for those effective in the art of dictation.It is applicable in blue eyes engine room, telephone applications like travel booking, financial account information, in military for coercive of weapons. By considering all the above factors it differs from other technologies as it produce written text from the users dictation, without using, or with only minimal use of, a traditional keyboard and mouse. This is an obvious benefit to many people who, for any number of reasons, do not find it easy to use a keyboard, or whose spelling and literacy skills would benefit from seeing occur. Speech recognition will revolutionize the wa y people conduct business over the network and will, ultimately, differentiate world-class ebusinesses the Web, decreases fatigue and make outd its own path across assorted fields.INTRODUCTIONEvidence of Artificial Intelligence folklore can be traced arse to ancient Egypt, but with the development of the electronic computer in 1941, the technology finally became available to create machine intelligence. The term artificial intelligence was first coined in 1956, at the Dartmouth conference, and since then Artificial Intelligence has expand because of the theories and principles developed by its dedicated researchers. Artificial intelligence, also known as machine intelligence, is defined as intelligence exhibited by anything manufactured (i.e. artificial) by humans or other sentient beings or systems (should such things ever exist on Earth or elsewhere). With the popularity of the AI computer growing, the arouse of the public has also grown. Applications for the Apple Macintosh a nd IBM compatible computer, such as voice and character recognition restrain become available. Also AI technology has made steadying camc redacts simple using fuzzy logic. With a greater demand for AI-related technology, new advancements are becoming available.Inevitably Artificial Intelligence has, and will continue to affecting our lives. Artificial Intelligence (AI) reason to develop computer-based systems that behave like humans learn languages accomplish personal tasks use a perceptual apparatus With the development of practical techniques based on AI research, advocates of AI have argued that opponents of AI have repeatedly changed their position on tasks such as computer chess or pitch recognition that were previously regarded as intelligent in order to deny the accomplishments of AI. They point out that this moving of the goalposts effectively defines intelligence as whatever humans can do that machines cannot.A vernacular recognition system is a type of software that allows the user to have their spoken words converted into written text in a computer application such as a word processor or spreadsheet. The computer can also be controlled by the use of spoken commands. Speech recognition software can be installed on a personal computer of appropriate specification. The user speaks into a microphone (a sound microphone is usually supplied with the product). The software generally requires an initial training and entry process in order to teach the software to recognize the voice of the user. A voice profile is then produced that is unique to that individual. This procedure also helps the user to learn how to speak to a computer.WORKINGThe user speaks to the computer through a microphone, which in turn, identifies the meaning of the words and sends it to NLP contrivance for further processing. Once recognized, the words can be used in a variety of applications like display, robotics, Commands to computers, and dictation .The word recognizer is a speech recognition system that identifies individual words.Following are a fewer of the basic terms and concepts that are fundamental to speech recognition. Utterances Pronunciations Grammar AccuracyThe speech quality varies from person to person. The grammar used by the speaker and accepted by the system, noise level, noise type, position of the microphone, and speed and manner of the users speech are some factors that whitethorn affect the quality of the speech recognition. The computer must be trained to the voice of that particular individual. such(prenominal) a system is called Speaker-dependent system. Speaker-independent system can be used by anybody, and can recognize any voice, even though the characteristics vary wide from one speaker to another. rescue DEPENDENT WORD RECOGNIERThe normal speech has a frequency range of 200 Hz to 7KHz. Recognizing a telephone call is more difficult as it has bandwidth limitations of 300Hz to 3.3KHz.As explained earlier the spoken words are processed by the filters and ADCs. The binary representation of each of these words becomes a template or standard against which the future words are compared. These templates are stored in the memory. Once the storing process is completed, the system can go into its active mode and is capable of identifying the spoken words. As each word is spoken, it is converted into binary equivalent and stored in RAM.The computer then starts inquiring and compares the binary input pattern with the templates. It is to be noted that even if the akin speaker talks the similar text, there are always lissome variations in amplitude or loudness of the signal, pitch, frequency difference, time crack etc.Due to this reason there is never a perfect match between the template and the binary input word. The pattern matching process thusly uses statistical techniques and is designed to look for the best fit. The value of binary input words are subtracted from the corresponding values in the templates . If both the values are same, the difference is zero and there is perfect match. If not the deduction produces some difference or error. the smaller the error the mitigate the match.SPEECH INDEPENDENT WORD RECOGNIZERThe search process takes a considerable fall of time, as the CPU has to make many comparisons before recognition occurs. This necessitates use of very high-speed processors. A Large RAM is also necessitate as even though a spoken word may last only a few hundred milliseconds, but the same is translated into many thousands of digital words. It is outstanding to note that alignment of words and speeds as well as elongate different parts of the same word. This is important for the speaker- independent recognizers.APPLICATIONS HEALTH CARESpeech recognition is used to modify deaf people to understand the spoken word via speech to text conversion, which is very helpful. Speech recognition is especially useful for people who have difficulty using their hands, ranging fro m mild repetitive stress injuries to involved disabilities that anticipate using conventional computer input devices.(HAND FREE COMPUTING). MILITARYspeech recognizers have been operated successfully in fighter aircraft with applications including setting radio frequencies, unequivocal an autopilot system, setting steer-point coordinates and weapons release parameters, and controlling flight displays.TRAINING direct TRAFFIC CONTROLLERSTraining for military (or civilian) air traffic controllers (ATC) represents an excellent application for speech recognition systems.TELEPHONYSpeech is used mostly as a part of User Interface, for creating pre-defined or custom speech commands.LIMITATIONS It needs to be completely tailored to the user and trained by the user. It is very much set up on one machine, and so can create difficulties for a user who Works from many locations, for example from school and home. It depends on the user having the desire to produce text and be able to gift th e Time, training and perseverance necessary to achieve it. It is most successful for those suitable in the art of dictationCONCLUSIONSpeech recognition had prevailed and achieved horrifying results in different fields .I t made our interactions with the computer easier than earlier. This technology had reduced the difference between human-to-human and human-to-machine interaction.FUTURE TRENDSIt would yield better results When it was made noise resistant. Understand our emotions User friendly as focus of a machine differs from humans It must be portable to use irrespective of the device.REFERENCESwww.seminoron .com www.edu.org www.ibm .com www.dragonsys.com

No comments:

Post a Comment