MODEL AND APPLY THE INTELLIGENCE OF HUMAN MIND. ARTIFICIAL INTELLIGENCE (AI) IS THE SIMULATION OF HUMAN INTELLIGENCE BY MACHINES. PRINCIPLES OF AI INCLUDE:- REASONING, KNOWLEDGE, PLANNING, LEARNING AND COMMUNICATION. PERCEPTION AND THE ABILITY TO MOVE AND MANIPULATE OBJECTS.
• JOHN McCARTHY COINED THE TERM ‘ARTIFICIAL INTELLIGENCE’ 1974 • COMPUTERS BECOME FASTER AND AFFORDABLE 1980 • THE YEAR OF ARTIFICIAL INTELLIGENCE 2000 • LANDMARK OF AI ESTABLISHMENT ACHIEVED
CONTROL CAMERA SYSTEMS DIRECTED TO THE INTENDED TARGETS UNMANNED DRONE AIRCRAFT • MISSILES FLY MILITARY MISSIONS AND LAUNCH WEAPONS FROM A DISTANCE DECODING OF ENEMY SECRET CODES • RAPID TRANSLATION OF FOREIGN LANGUAGE
CLINICAL STUDIES IN ANIMALS AND HUMANS REMOTE CONTROLLED SURGICAL PROCEDURES • USES ROBOTICS AND COMPUTER BASED OPTICAL SYSTEMS ADVANCED WEATHER MODELING • DOPPLER RADAR • LOOKS AT TRENDS AND PAST MODELS TO PREDICT WEATHER PATTERNS
AND COMPUTERS ARE PLACED ON ASSEMBLY LINES TO FIND DEFECTS ROBOTICS SYSTEMS • USE COMPUTER AUTOMATION AND MECHANICAL EQUIPMENT • TAKE PLACE OF HUMANS IN MANUFACTURING CARS
SYSTEMS THAT UNDERSTANDS, THINK, LEARN AND BEHAVE LIKE HUMANS. KALPANA IS A SMART DIGITAL ASSISTANT BASED ON PYTHON LANGUAGE IT IS AN OPEN SOURCE SOFTWARE KALPANA ASSISTS WITH YOUR DAILY LIFE
KALPANA LISTENS TO THE VOICE AND MATCH WITH AVAILABLE COMMANDS IN ITS LIBRARY IF IT IS AVAILABLE THEN PROPER RESPONSE IS PROVIDED AND PROPER ACTION IS TAKEN KALPANA REPLIES WITH A SUITABLE ANSWER
INTELLIGENCE TECHNIQUES, INCLUDING NATURAL LANGUAGE PROCESSING SPEECH RECOGNITION SPEECH TO TEXT HUNT FOR THE COMMAND IN THE DATABASE TEXT TO SPEECH
PI IS A LOW COST, CREDIT-CARD SIZED COMPUTER THAT PLUGS INTO A COMPUTER MONITOR OR TV, AND USES A STANDARD KEYBOARD AND MOUSE. THE RASPBERRY PI HAS THE ABILITY TO INTERACT WITH THE OUTSIDE WORLD, AND HAS BEEN USED IN A WIDE ARRAY OF DIGITAL MAKER PROJECTS
TO THE RASPBERRY PI FOR THE VOICE INPUT • TYPE: MOBILE OR LAPTOP • POWER: 12-48 VOLTS • FREQ RANGE: 80Hz – 15KHz • CURRENT RATING: 4-22 mA • SENSITIVITY: 94 dB
RASPBERRY PI OS (FORMERLY RASPBIAN) IS A DEBIAN BASED OPERATING SYSTEM FOR RASPBERRY PI. • RASPBERRY PI OS LOOKS SIMILAR TO MANY COMMON DESKTOPS SUCH AS MACOS AND MICROSOFT WINDOWS. • PACKAGES CAN BE INSTALLED VIA APT, THE RECOMMENDED SOFTWARE APP, AND BY USING THE ADD/REMOVE SOFTWARE TOOL, A GUI WRAPPER FOR APT.
• IT’S A CONVERGENCE OF ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL LINGUISTICS WHICH HANDLES INTERACTIONS BETWEEN MACHINES AND NATURAL LANGUAGES OF HUMANS IN WHICH COMPUTERS ARE ENTAILED TO ANALYZE, UNDERSTAND, ALTER, OR GENERATE NATURAL LANGUAGE. • NLP HELPS COMPUTER MACHINES TO ENGAGE IN COMMUNICATION USING NATURAL HUMAN LANGUAGE IN MANY FORMS.
SPEECH TO TEXT SOFTWARE, OR AUTOMATIC SPEECH RECOGNITION (ASR) SOFTWARE, OR VOICE TO TEXT SOFTWARE, IS A COMPUTER PROGRAM THAT USES LINGUISTIC ALGORITHMS TO SORT AUDITORY SIGNALS AND TRANSFORM THAT INFORMATION INTO WORDS USING UNICODE CHARACTERS. • MORE SIMPLY, SPEECH TO TEXT SOFTWARE ‘LISTENS’ TO AUDIO AND DELIVERS AN EDITABLE, VERBATIM TRANSCRIPT.
TEXT WORKS? • THERE ARE MULTIPLE STEPS INVOLVED IN THE PROCESS OF CONVERTING SPEECH INTO TEXT. • WHEN YOU’RE TALKING, YOU CREATE A SERIES OF VIBRATIONS. THESE ARE TRANSLATED INTO DIGITAL LANGUAGE BY THE ANALOGUE-TO-DIGITAL CONVERTER OR THE ADC.
TEXT WORKS? • THE ADC IS ABLE TO COMPLETE THIS CONVERSION BY SAMPLING SOUNDS FROM AN AUDIO FILE AND TAKING FREQUENT, VERY DETAILED MEASUREMENTS OF THE WAVES. • THE SYSTEM HAS A FILTER TO DISTINGUISH THE SOUNDS THAT ARE RELEVANT AND DIFFERENTIATE FREQUENCIES. THE SPEED OF THE SPEECH IS ALSO MODIFIED AND THE VOLUME SET AT A CONTROL LEVEL.
TEXT WORKS? • THE NEXT STAGE INVOLVES SEGMENTING THE SIGNAL INTO HUNDREDTHS OR THOUSANDTHS OF SECONDS AND MATCHING THESE PARTS TO PHONEMES (A PHONEME IS A UNIT OF SOUND THAT DISTINGUISHES ONE WORD FROM ANOTHER IN A PARTICULAR LANGUAGE). • THERE ARE OVER 40 PHONEMES WITHIN THE ENGLISH LANGUAGE.
TEXT WORKS? • EACH PHONEME IS THEN EXAMINED AND EVALUATED IN RELATION TO OTHER PHONEMES AROUND THEM, AND THE SYSTEM THEN RUNS THE NETWORK OF PHONEMES THROUGH A COMPLICATED MATHEMATICAL MODEL TO COMPARE THEM TO WELL-KNOWN SENTENCES, INDIVIDUAL WORDS AND PHRASES. THE SYSTEM USING MACHINE LEARNING THEN CREATES TEXT BASED ON WHAT IS MOST PROBABLE THAT THE PERSON SAID. THIS IS EITHER PRESENTED AS A CHUNK OF TEXT (TEXT FILE) OR AS A FINAL COMPUTER-BASED COMMAND.
THE DATABASE • KALPANA BREAKS DOWN YOUR “COMMANDS” INTO INDIVIDUAL SOUNDS. IT THEN CONSULTS A DATABASE CONTAINING VARIOUS WORDS OR PRONUNCIATIONS TO FIND WHICH WORDS MOST CLOSELY CORRESPOND TO THE COMBINATION OF INDIVIDUAL SOUNDS. • IT THEN IDENTIFIES IMPORTANT WORDS TO MAKE SENSE OF THE TASKS AND CARRY OUT CORRESPONDING FUNCTIONS. • THE DATABASE SEND THE INFORMATION BACK TO THE DEVICE AND KALPANA MAY SPEAK.
TEXT TO SPEECH OR SPEECH SYNTHESIS IS THE ARTIFICIAL PRODUCTION OF THE HUMAN SPEECH. • A COMPUTER SYSTEM USED FOR THIS PURPOSE IS CALLED A SPEECH SYNTHESIZER, AND CAN BE IMPLEMENTED IN SOFTWARE OR HARDWARE PRODUCTS.
SPEECH WORKS? • A TEXT-TO-SPEECH (TTS) SYSTEM CONVERTS NORMAL LANGUAGE TEXT INTO SPEECH; OTHER SYSTEMS RENDER SYMBOLIC LINGUISTIC REPRESENTATIONS LIKE PHONETIC TRANSCRIPTIONS INTO SPEECH. • SYNTHESIZED SPEECH CAN BE CREATED BY CONCATENATING PIECES OF RECORDED SPEECH THAT ARE STORED IN A DATABASE.
SPEECH WORKS? • A TEXT-TO-SPEECH SYSTEM IS COMPOSED OF TWO PARTS A FRONT-END AND A BACK-END. • THE FRONT-END HAS TWO MAJOR TASKS. • FIRST, IT CONVERTS RAW TEXT CONTAINING SYMBOLS LIKE NUMBERS AND ABBREVIATIONS INTO THE EQUIVALENT OF WRITTEN-OUT WORDS.THIS PROCESS IS OFTEN CALLED TEXT NORMALIZATION, PRE-PROCESSING, OR TOKENIZATION.
SPEECH WORKS? • THE FRONT-END THEN ASSIGNS PHONETIC TRANSCRIPTIONS TO EACH WORD, AND DIVIDES AND MARKS THE TEXT INTO PROSODIC UNITS, LIKE PHRASES, CLAUSES, AND SENTENCES. THE PROCESS OF ASSIGNING PHONETIC TRANSCRIPTIONS TO WORDS IS CALLED TEXT-TO-PHONEME OR GRAPHEME-TO-PHONEME CONVERSION. PHONETIC TRANSCRIPTIONS AND PROSODY INFORMATION TOGETHER MAKE UP THE SYMBOLIC LINGUISTIC REPRESENTATION THAT IS OUTPUT BY THE FRONT-END.
SPEECH WORKS? • THE BACK-END, OFTEN REFERRED TO AS THE SYNTHESIZER. • IT CONVERTS THE SYMBOLIC LINGUISTIC REPRESENTATION INTO SOUND. IN CERTAIN SYSTEMS, THIS PART INCLUDES THE COMPUTATION OF THE TARGET PROSODY (PITCH CONTOUR, PHONEME DURATIONS),[4] WHICH IS THEN IMPOSED ON THE OUTPUT SPEECH.