Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Project.pdf

 Project.pdf

Jojo Joseph

May 31, 2021
Tweet

Transcript

  1. VOICE AIDED SELF LEARNING MACHINE (KALPANA) PROJECT GUIDE: SYNDICATE MEMBERS:

    SHRAVAN RAM, MCMECH (AR) II ISHTIYAQUE ANSARI, LEMA (R) JITHINRAJ JS, LEMA (R) AK BARIK, LEMA(R)
  2. CONTENTS 1. INTRODUCTION 2. HISTORY 3. APPLICATIONS 4. PROJECT KALPANA

    5. AVAILABLE APPLICATIONS 6. CONCLUSION 7. BIBLIOGRAPHY
  3. INTRODUCTION  THE STUDY OF COMPUTER SYSTEMS THAT ATTEMPT TO

    MODEL AND APPLY THE INTELLIGENCE OF HUMAN MIND.  ARTIFICIAL INTELLIGENCE (AI) IS THE SIMULATION OF HUMAN INTELLIGENCE BY MACHINES.  PRINCIPLES OF AI INCLUDE:- REASONING, KNOWLEDGE, PLANNING, LEARNING AND COMMUNICATION. PERCEPTION AND THE ABILITY TO MOVE AND MANIPULATE OBJECTS.
  4. HISTORY 1950 • THE TIME WHEN IT ALL STARTED 1955

    • JOHN McCARTHY COINED THE TERM ‘ARTIFICIAL INTELLIGENCE’ 1974 • COMPUTERS BECOME FASTER AND AFFORDABLE 1980 • THE YEAR OF ARTIFICIAL INTELLIGENCE 2000 • LANDMARK OF AI ESTABLISHMENT ACHIEVED
  5. APPLICATIONS  MILITARY  SMART BOMBS • GUIDED BY REMOTE

    CONTROL CAMERA SYSTEMS DIRECTED TO THE INTENDED TARGETS  UNMANNED DRONE AIRCRAFT • MISSILES FLY MILITARY MISSIONS AND LAUNCH WEAPONS FROM A DISTANCE  DECODING OF ENEMY SECRET CODES • RAPID TRANSLATION OF FOREIGN LANGUAGE
  6. APPLICATIONS  SCIENCE  DEVELOPMENT OF NEW DRUGS • MODEL

    CLINICAL STUDIES IN ANIMALS AND HUMANS  REMOTE CONTROLLED SURGICAL PROCEDURES • USES ROBOTICS AND COMPUTER BASED OPTICAL SYSTEMS  ADVANCED WEATHER MODELING • DOPPLER RADAR • LOOKS AT TRENDS AND PAST MODELS TO PREDICT WEATHER PATTERNS
  7. APPLICATIONS  INDUSTRIAL  MACHINE VISION INSPECTION SYSTEM • CAMERAS

    AND COMPUTERS ARE PLACED ON ASSEMBLY LINES TO FIND DEFECTS  ROBOTICS SYSTEMS • USE COMPUTER AUTOMATION AND MECHANICAL EQUIPMENT • TAKE PLACE OF HUMANS IN MANUFACTURING CARS
  8. PROJECT KALPANA  OVERVIEW  HUMANS HAVE DEVELOPED ARTIFICIAL INTELLIGENCE

    SYSTEMS THAT UNDERSTANDS, THINK, LEARN AND BEHAVE LIKE HUMANS.  KALPANA IS A SMART DIGITAL ASSISTANT  BASED ON PYTHON LANGUAGE  IT IS AN OPEN SOURCE SOFTWARE  KALPANA ASSISTS WITH YOUR DAILY LIFE
  9. PROJECT KALPANA  HOW IT WORKS  SPEAK “HEY KALPANA”

     KALPANA LISTENS TO THE VOICE AND MATCH WITH AVAILABLE COMMANDS IN ITS LIBRARY  IF IT IS AVAILABLE THEN PROPER RESPONSE IS PROVIDED AND PROPER ACTION IS TAKEN  KALPANA REPLIES WITH A SUITABLE ANSWER
  10. PROJECT KALPANA  BUILDING KALPANA  IT USES SEVERAL ARTIFICIAL

    INTELLIGENCE TECHNIQUES, INCLUDING  NATURAL LANGUAGE PROCESSING  SPEECH RECOGNITION  SPEECH TO TEXT  HUNT FOR THE COMMAND IN THE DATABASE  TEXT TO SPEECH
  11. PROJECT KALPANA  BUILDING KALPANA  HARDWARE REQUIREMETS  RASPBERRY

    PI 3B+  MICROPHONE  SPEAKER  SOUND CARD  POWER ADAPTER
  12. PROJECT KALPANA  BUILDING KALPANA  RASPBERRY PI THE RASPBERRY

    PI IS A LOW COST, CREDIT-CARD SIZED COMPUTER THAT PLUGS INTO A COMPUTER MONITOR OR TV, AND USES A STANDARD KEYBOARD AND MOUSE. THE RASPBERRY PI HAS THE ABILITY TO INTERACT WITH THE OUTSIDE WORLD, AND HAS BEEN USED IN A WIDE ARRAY OF DIGITAL MAKER PROJECTS
  13. PROJECT KALPANA  BUILDING KALPANA  MICROPHONE MICROPHONE IS CONNECTED

    TO THE RASPBERRY PI FOR THE VOICE INPUT • TYPE: MOBILE OR LAPTOP • POWER: 12-48 VOLTS • FREQ RANGE: 80Hz – 15KHz • CURRENT RATING: 4-22 mA • SENSITIVITY: 94 dB
  14. PROJECT KALPANA  BUILDING KALPANA  SPEAKER SPEAKERS ARE CONNECTED

    TO RASPBERRY PI FOR THE VOICE OUTPUT • TYPE: MOBILE OR LAPTOP • POWER O/P: 10 W • FREQ RANGE: 20Hz – 20KHz • IMPEDANCE: 4 OHMS • SENSITIVITY: 95 dB
  15. PROJECT KALPANA  BUILDING KALPANA  SOUND CARD ADAPTER WITH

    3.5 mm HEADPHONE MIC AUDIO JACK TO 3.5 mm Y SPLITTER CABLE WITH SEPARATE HEADSET/ MICROPHONE (1 MALE USB TO 2 FEMALE 3.5 mm JACKS)
  16. PROJECT KALPANA  BUILDING KALPANA  RASPBERRY PI OS •

    RASPBERRY PI OS (FORMERLY RASPBIAN) IS A DEBIAN BASED OPERATING SYSTEM FOR RASPBERRY PI. • RASPBERRY PI OS LOOKS SIMILAR TO MANY COMMON DESKTOPS SUCH AS MACOS AND MICROSOFT WINDOWS. • PACKAGES CAN BE INSTALLED VIA APT, THE RECOMMENDED SOFTWARE APP, AND BY USING THE ADD/REMOVE SOFTWARE TOOL, A GUI WRAPPER FOR APT.
  17. PROJECT KALPANA  BUILDING KALPANA  NATURAL LANGUAGE PROCESSING (NLP)

    • IT’S A CONVERGENCE OF ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL LINGUISTICS WHICH HANDLES INTERACTIONS BETWEEN MACHINES AND NATURAL LANGUAGES OF HUMANS IN WHICH COMPUTERS ARE ENTAILED TO ANALYZE, UNDERSTAND, ALTER, OR GENERATE NATURAL LANGUAGE. • NLP HELPS COMPUTER MACHINES TO ENGAGE IN COMMUNICATION USING NATURAL HUMAN LANGUAGE IN MANY FORMS.
  18. PROJECT KALPANA  BUILDING KALPANA  SPEECH TO TEXT •

    SPEECH TO TEXT SOFTWARE, OR AUTOMATIC SPEECH RECOGNITION (ASR) SOFTWARE, OR VOICE TO TEXT SOFTWARE, IS A COMPUTER PROGRAM THAT USES LINGUISTIC ALGORITHMS TO SORT AUDITORY SIGNALS AND TRANSFORM THAT INFORMATION INTO WORDS USING UNICODE CHARACTERS. • MORE SIMPLY, SPEECH TO TEXT SOFTWARE ‘LISTENS’ TO AUDIO AND DELIVERS AN EDITABLE, VERBATIM TRANSCRIPT.
  19. PROJECT KALPANA  BUILDING KALPANA  HOW DOES SPEECH TO

    TEXT WORKS? • THERE ARE MULTIPLE STEPS INVOLVED IN THE PROCESS OF CONVERTING SPEECH INTO TEXT. • WHEN YOU’RE TALKING, YOU CREATE A SERIES OF VIBRATIONS. THESE ARE TRANSLATED INTO DIGITAL LANGUAGE BY THE ANALOGUE-TO-DIGITAL CONVERTER OR THE ADC.
  20. PROJECT KALPANA  BUILDING KALPANA  HOW DOES SPEECH TO

    TEXT WORKS? • THE ADC IS ABLE TO COMPLETE THIS CONVERSION BY SAMPLING SOUNDS FROM AN AUDIO FILE AND TAKING FREQUENT, VERY DETAILED MEASUREMENTS OF THE WAVES. • THE SYSTEM HAS A FILTER TO DISTINGUISH THE SOUNDS THAT ARE RELEVANT AND DIFFERENTIATE FREQUENCIES. THE SPEED OF THE SPEECH IS ALSO MODIFIED AND THE VOLUME SET AT A CONTROL LEVEL.
  21. PROJECT KALPANA  BUILDING KALPANA  HOW DOES SPEECH TO

    TEXT WORKS? • THE NEXT STAGE INVOLVES SEGMENTING THE SIGNAL INTO HUNDREDTHS OR THOUSANDTHS OF SECONDS AND MATCHING THESE PARTS TO PHONEMES (A PHONEME IS A UNIT OF SOUND THAT DISTINGUISHES ONE WORD FROM ANOTHER IN A PARTICULAR LANGUAGE). • THERE ARE OVER 40 PHONEMES WITHIN THE ENGLISH LANGUAGE.
  22. PROJECT KALPANA  BUILDING KALPANA  HOW DOES SPEECH TO

    TEXT WORKS? • EACH PHONEME IS THEN EXAMINED AND EVALUATED IN RELATION TO OTHER PHONEMES AROUND THEM, AND THE SYSTEM THEN RUNS THE NETWORK OF PHONEMES THROUGH A COMPLICATED MATHEMATICAL MODEL TO COMPARE THEM TO WELL-KNOWN SENTENCES, INDIVIDUAL WORDS AND PHRASES. THE SYSTEM USING MACHINE LEARNING THEN CREATES TEXT BASED ON WHAT IS MOST PROBABLE THAT THE PERSON SAID. THIS IS EITHER PRESENTED AS A CHUNK OF TEXT (TEXT FILE) OR AS A FINAL COMPUTER-BASED COMMAND.
  23. PROJECT KALPANA  BUILDING KALPANA  LOOKING FOR COMMAND IN

    THE DATABASE • KALPANA BREAKS DOWN YOUR “COMMANDS” INTO INDIVIDUAL SOUNDS. IT THEN CONSULTS A DATABASE CONTAINING VARIOUS WORDS OR PRONUNCIATIONS TO FIND WHICH WORDS MOST CLOSELY CORRESPOND TO THE COMBINATION OF INDIVIDUAL SOUNDS. • IT THEN IDENTIFIES IMPORTANT WORDS TO MAKE SENSE OF THE TASKS AND CARRY OUT CORRESPONDING FUNCTIONS. • THE DATABASE SEND THE INFORMATION BACK TO THE DEVICE AND KALPANA MAY SPEAK.
  24. PROJECT KALPANA  BUILDING KALPANA  TEXT TO SPEECH •

    TEXT TO SPEECH OR SPEECH SYNTHESIS IS THE ARTIFICIAL PRODUCTION OF THE HUMAN SPEECH. • A COMPUTER SYSTEM USED FOR THIS PURPOSE IS CALLED A SPEECH SYNTHESIZER, AND CAN BE IMPLEMENTED IN SOFTWARE OR HARDWARE PRODUCTS.
  25. PROJECT KALPANA  BUILDING KALPANA  HOW DOES TEXT TO

    SPEECH WORKS? • A TEXT-TO-SPEECH (TTS) SYSTEM CONVERTS NORMAL LANGUAGE TEXT INTO SPEECH; OTHER SYSTEMS RENDER SYMBOLIC LINGUISTIC REPRESENTATIONS LIKE PHONETIC TRANSCRIPTIONS INTO SPEECH. • SYNTHESIZED SPEECH CAN BE CREATED BY CONCATENATING PIECES OF RECORDED SPEECH THAT ARE STORED IN A DATABASE.
  26. PROJECT KALPANA  BUILDING KALPANA  HOW DOES TEXT TO

    SPEECH WORKS? • A TEXT-TO-SPEECH SYSTEM IS COMPOSED OF TWO PARTS A FRONT-END AND A BACK-END. • THE FRONT-END HAS TWO MAJOR TASKS. • FIRST, IT CONVERTS RAW TEXT CONTAINING SYMBOLS LIKE NUMBERS AND ABBREVIATIONS INTO THE EQUIVALENT OF WRITTEN-OUT WORDS.THIS PROCESS IS OFTEN CALLED TEXT NORMALIZATION, PRE-PROCESSING, OR TOKENIZATION.
  27. PROJECT KALPANA  BUILDING KALPANA  HOW DOES TEXT TO

    SPEECH WORKS? • THE FRONT-END THEN ASSIGNS PHONETIC TRANSCRIPTIONS TO EACH WORD, AND DIVIDES AND MARKS THE TEXT INTO PROSODIC UNITS, LIKE PHRASES, CLAUSES, AND SENTENCES. THE PROCESS OF ASSIGNING PHONETIC TRANSCRIPTIONS TO WORDS IS CALLED TEXT-TO-PHONEME OR GRAPHEME-TO-PHONEME CONVERSION. PHONETIC TRANSCRIPTIONS AND PROSODY INFORMATION TOGETHER MAKE UP THE SYMBOLIC LINGUISTIC REPRESENTATION THAT IS OUTPUT BY THE FRONT-END.
  28. PROJECT KALPANA  BUILDING KALPANA  HOW DOES TEXT TO

    SPEECH WORKS? • THE BACK-END, OFTEN REFERRED TO AS THE SYNTHESIZER. • IT CONVERTS THE SYMBOLIC LINGUISTIC REPRESENTATION INTO SOUND. IN CERTAIN SYSTEMS, THIS PART INCLUDES THE COMPUTATION OF THE TARGET PROSODY (PITCH CONTOUR, PHONEME DURATIONS),[4] WHICH IS THEN IMPOSED ON THE OUTPUT SPEECH.
  29. PROJECT KALPANA  ADVANTAGES  EASY TO USE  CAN

    WORK WITH VAREITY OF COMMANDS  CUSTOM COMMANDS  HELPFUL FOR DISABLED
  30. PROJECT KALPANA  DISADVANTAGES  LIMITED LANGUAGE SUPPORT  COSTLY

     EXPENSIVE EQUIPMENTS  CANNOT WORK IN NOISY ENVIRONMENT
  31. CONCLUSION  KALPANA IS A DIGITAL AND VIRTUAL ASSISTANT WITH

    ARTIFICIAL INTELLIGENCE  FLEXIBLE AND USEFUL TECHNOLOGY  PROVIDES A BETTER INTERFACE TO DEAL WITH IT
  32. BIBLIOGRAPHY  https://en.wikipedia.org/wiki/Artificial_intelligence  https://en.wikipedia.org/wiki/Virtual_assistant  https://en.wikipedia.org/wiki/Amazon_Alexa  https://developer.amazon.com/en-US/alexa 

    https://towardsdatascience.com/how-amazon-alexa-works-your- guide-to-natural-language-processing-ai-7506004709d3  https://en.wikipedia.org/wiki/Speech_recognition  https://en.wikipedia.org/wiki/Speech_synthesis  https://en.wikipedia.org/wiki/Raspberry_Pi  https://www.raspberrypi.org/software/operating-systems/