International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 11 Issue: 10 | Oct 2024
p-ISSN: 2395-0072
www.irjet.net
Desktop Personal Voice Assistant using ML and NLP Mrs S. Niveditha1, B. Jaswanth Reddy2, B. Sai Charan Reddy3, Hitendra Singh Shekhawat4 Asst. Professor1 , Student2 , Student3 , Student4 Department of Computer Science and Engineering1, SRM Institute of Science and Technology, Chennai, India1 --------------------------------------------------------------------------***----------------------------------------------------------------------Abstract— The voice assistant project presents a powerful and user-centric tool designed to streamline daily activities through the seamless integration of advanced natural language processing (NLP) and speech technologies. Equipped with multilingual capabilities, it supports English, Hindi, Telugu, and Tamil, making it accessible to a diverse user base. This versatile assistant offers a wide array of functionalities aimed at improving both personal and professional productivity. Users can effortlessly access real-time news searches, manage emails, receive weather updates, and browse the web—all through voice commands. By leveraging OpenAI’s API for speech-to-text (STT) and text-to-speech (TTS) capabilities, the assistant can accurately interpret user inputs and execute tasks with precision. Its conversational abilities enhance the user experience by allowing for smooth interactions, ensuring that the assistant is highly responsive and adaptable to a range of requests.
Hands-free Operations, Conversational Personalized Content Delivery
I. INTRODUCTION Voice assistants have become indispensable in modern life, offering a seamless way to interact with technology using voice commands. This project focuses on creating a robust, multilingual voice assistant aimed at simplifying both personal and professional workflows. As speech recognition and artificial intelligence (AI) technologies continue to evolve, voice assistants have progressed from basic command-driven systems to intelligent entities capable of understanding context and performing a range of tasks in real time. The assistant in this project is designed to meet diverse user needs by enabling tasks such as web browsing, email management, weather updates, and news summaries, all through natural, conversational speech. Its multilingual capability ensures that users can communicate in their preferred language, making it inclusive and adaptable to varied demographics.
Beyond task execution, this assistant goes a step further in offering personalized services that enhance convenience. It provides features such as reading and summarizing emails, setting reminders, and generating daily news summaries, which can be delivered via email for easy access. This voice assistant is a step toward enabling hands-free operations, empowering users with a tool that simplifies multitasking and delivers information seamlessly. Whether it's keeping up with important communications or staying informed with the latest news, the assistant’s intuitive design ensures a smooth, efficient user experience. Its combination of voice command versatility and personalized content delivery makes it an indispensable tool for anyone looking to enhance their workflow and stay connected in an increasingly digital world.
At the heart of this voice assistant is the drive to enhance productivity and accessibility. Users can speak to the assistant to manage their daily tasks without needing to manually navigate through multiple apps. For instance, it offers the ability to summarize emails, saving users time and ensuring they stay informed about important communications. Whether it's searching the web for quick information or providing detailed summaries of weather forecasts, the assistant handles these tasks efficiently. The assistant’s web search functionality is particularly useful for research or quick lookups, offering concise results in a matter of seconds. It makes staying updated with daily tasks more manageable, empowering users with a handsfree, voice-driven experience that fits into their busy lives. Leveraging cutting-edge technologies such as OpenAI’s language models and Whisper for speech-to-text (STT), the voice assistant ensures highly accurate and fluid interactions. Its text-to-speech (TTS) functionality delivers responses in a natural, conversational tone, making interactions more human-like. A standout feature is its ability to generate daily news summaries across various
Keywords — Voice Assistant, Natural Language Processing (NLP), Multilingual Support, OpenAI API, Speech-to-Text (STT), Text-to-Speech (TTS), Email Management, Weather Updates, News Summaries, Web Browsing, Task Automation, Personal Assistant,
© 2024, IRJET
|
Impact Factor value: 8.315
AI,
|
ISO 9001:2008 Certified Journal
|
Page 732