Read the article to know how it works: Medium Article
With File GPT you will be able to extract all the information from a file. You will obtain the transcription, the embedding of each segment and also ask questions to the file through a chat.
All code was written with the help of Code GPT
- Read any pdf, docx, txt or csv file
- Embedding texts segments with Langchain and OpenAI (text-embedding-ada-002)
- Chat with the file using streamlit-chat and LangChain QA with source and (text-davinci-003)
- Clone the repository
git clone https://github.com/davila7/file-gpt
cd file-gpt
- Install dependencies
These dependencies are required to install with the requirements.txt file:
- openai
- pypdf
- scikit-learn
- numpy
- tiktoken
- docx2txt
- langchain
- pydantic
- typing
- faiss-cpu
- streamlit_chat
pip install -r requirements.txt
- Run the Streamlit server
streamlit run app.py