Docudetective (2) (1) Final
Docudetective (2) (1) Final
Docudetective (2) (1) Final
INTRODUCTION
Chapter 1
INTRODUCTION
1.1 Overview
The project involves developing a chatbot that provides answers by
referencing a PDF document uploaded by the user. The system utilizes Python, Streamlit,
and LangChain to create a seamless user experience. The Streamlit module is employed for
building a user-friendly interface, enabling users to upload PDF documents. LangChain is
integrated to process natural language queries and extract relevant information from the
uploaded PDF. In cases where the chatbot cannot find an answer within the provided
document, it dynamically searches the internet for relevant information. This
comprehensive approach ensures that users receive accurate and extensive responses,
combining local document knowledge with real-time internet-based information retrieval.
The synergistic use of Streamlit and LangChain enhances the user interface and natural
language processing capabilities, respectively, providing a robust and interactive solution
for answering queries based on both local and online resources.
1.5 Motivation
Motivation for this project stems from the desire to enhance user accessibility to
information within a seamless conversational interface. By developing a chatbot capable of
referencing uploaded PDF files, we empower users to effortlessly retrieve specific
information. Moreover, integrating an internet search functionality ensures a
comprehensive knowledge base, ensuring users receive accurate and up-to-date responses
even when the information is not present in the uploaded documents. This innovative
approach aligns with the evolving needs of users, offering a dynamic and efficient solution
for information retrieval.
1.6 Objectives
Nagendra prasad Krishnam, Dr. Ashim Bora et al., [2], It proposes a development of a
multilingual talkbot, powered by AI and NLP technologies, to address the high demand for
student inquiries in both English and Arabic. This talkbot is designed to provide quick and
accurate responses to questions related to college policies, academic processes, and
extracurriculars. Absence of an extensive discussion regarding the scalability, real-world
implementation challenges, and adaptability to diverse and evolving educational
environments.
C Kavitha, K Pavan., [3], It envison and creation of chatbot framework that attempts to
improve the interest skill of chatbot by intelligently recognizing and gathering lacking data
from the user that are required for answer generation. There is lack of detailed exploration
into the practical implementation also following a gap in the research because we need
more real-world testing and results to understand how effective this chatbot system.
Asha, Nithya, Durgaprasad, Kiran., [4], It proposes an algorithm to train a chatbot that
helps giving instantaneous answers for the users query using Natural language processing,
deep learning. It also suggests that we can improve the machine by implementing
Recurrent Neural Network. It focuses on the technical aspects of chatbot development and
their applications but does not mention the ethical and privacy considerations associated
with using chatbots.
Guru kiran,Angad Pal, Rishi J, Saritha K., [6], It proposes a practical application
implementing a Chatbot in PES University which is used to resolve the queries of students
and guest regarding the college management, the courses and general queries related to the
college.But the model can answer only predefined questions and no out of the box
question can be asked.
Ana Rodriguez, Rodrigo mejia, Karen Cornejo., [7], It proposes an algorithm for
Implementation of Business Process Outsourcing to phrase multiple types of input
including text and audio.The main objective is to help client grow economically through
optimization of time and resource. But it can be used only on CSV file which is used to
understand the financial growth of the business and give predictive answers.
Sardar Jaf, Kenneth McGarry., [9], It gives detailed explanations on the recent
advancement in Chatbot in terms of language model, applications, dataset used and
evaluation framework.
Julija Skrebeca, Paula Kalniete, Janis Goldbergs., [10], Advocates the widespread use
of chatbots, particularly during challenging times like the COVID-19 pandemic. These
chatbots, powered by artificial intelligence, are proposed to serve a dual purpose: as
interactive teaching systems in education and as tools to e-commerce processes. It lacks a
comprehensive exploration of chatbots' potential to bridge the domains of education and
Munira Ansari, Talha Khan, Anupam Singh., [13], This is a theoretical papers, which
tells us all the ways of how to implement web based chatbot and compares different
technique by giving all the pros and cons. In this cloud storage plays a vital role and might
be expensive if not used properly.
A use case diagram is crucial in systems analysis and design as it illustrates the
interactions between system components and external entities, showcasing how users
interact with a system. It helps identify and define system functionalities, requirements, and
user roles, providing a high-level view of the system's behavior. This diagram serves as a
communication tool between stakeholders, aiding in the understanding of system
functionality and ensuring alignment with user needs and business goals.
SYSTEM ARCHITECTURE
Modules:
API calls:
• Client responsibility: The client will send a GET request to the web api with the
question as a URL parameter. The client will upload the pdf and then ask the
question in a single line.
• Server responsibility: The server will send all the API data in JSON response
documents. The sever will respond with 200 OK status code if the request is found
in the database. The server will respond with 400 BAD status code if the request is
not found in the document and will reach out to SERP API.
Software Requirement
Hardware Requirement
APPLICATIONS
1. Customer Support:
• Users can upload product manuals or guides, and the chatbot can help them find
information or troubleshoot issues.
• The chatbot can refer to the uploaded documentation to provide step-by-step instructions
or solutions.
2. Educational Assistance:
• Students can upload lecture notes, textbooks, or research papers, and the chatbot can
help them with questions related to the material.
• The chatbot can serve as a study aid, providing explanations or additional information
based on the uploaded documents.
3. Technical Documentation:
• In a professional setting, employees can upload technical manuals or documentation,
and the chatbot can assist with queries related to procedures or specifications.
• It can help streamline the process of accessing and understanding complex technical
information.
4. Legal Assistance:
• Users can upload legal documents, and the chatbot can provide information on legal
terms, processes, or general advice.
• The chatbot can guide users through legal documents and offer explanations or
interpretations.
5. Research Support:
• Researchers can use the chatbot to quickly access relevant information from a large set
of documents.
• The chatbot can provide summaries, key points, or references from the uploaded
documents.
6. Healthcare Information:
• Patients can upload medical documents, and the chatbot can provide explanations of
medical terms, procedures, or general health information.
• The chatbot can also direct users to reliable online health resources.