Commit Graph

305 Commits

Author SHA1 Message Date
Iván Martínez 5943ad1bf7
Merge pull request #659 from doskoi/patch-2
Skip for empty query
2023-06-11 19:13:58 +02:00
Iván Martínez b2b5fd4298
Merge pull request #675 from nb-programmer/main
Update README.md instructions of .env file
2023-06-11 19:11:09 +02:00
Iván Martínez 51fa989679
Merge pull request #660 from doskoi/master
Improving performance for PDF loader
2023-06-11 19:10:08 +02:00
imaprogrammer c4b247d696
Update README.md instructions of .env file
Clarified to create a copy of example.env instead of renaming it to prevent accidentally removing from repo
2023-06-10 10:25:50 +05:30
sj 05c7330643 Enhancement better performance for PDF loader 2023-06-07 23:51:05 +08:00
Jiang Sheng ddfb95a32e
Skip for empty query 2023-06-07 23:29:55 +08:00
Iván Martínez 9d47d03d18
Merge pull request #560 from ravindraprasad75/fix-csv-issue
fixed the the csv file reading issue
2023-06-01 10:25:37 +02:00
Ravi e9b31f7dd9
Update ingest.py
Co-authored-by: Bailey Matthews <bailey@hey.com>
2023-05-31 22:42:10 +05:30
Ravindra Prasad db341e2a40 fixed the the csv file reading issue 2023-05-31 00:04:56 +05:30
Iván Martínez 60e6bd25eb
Merge pull request #474 from maozdemir/patch-3
fix: Add `TARGET_SOURCE_CHUNKS` to `example.env`
2023-05-25 12:44:03 +02:00
maozdemir 2027ac563b
fix: Add `TARGET_SOURCE_CHUNKS` to `example.env`
@imartinez
2023-05-25 13:36:43 +03:00
Iván Martínez e6d6af4f82
Merge pull request #460 from maozdemir/feat/documents
feat: Get answers using preferred number of chunks
2023-05-25 08:26:20 +02:00
impulsivus cf709a6b7a
feat: Get answers using preferred number of chunks 2023-05-24 21:16:58 +03:00
Iván Martínez 573c4363c4 Update LangChain to 0.0.177 and GPT4ALL bindings library 2023-05-23 12:42:27 +02:00
Iván Martínez fb94b9d1d4
Merge pull request #387 from maozdemir/patch-1
typo: Change `pip` to `pip3` in README.md
2023-05-22 23:23:29 +02:00
maozdemir 6065918d0f
typo: Change `pip` to `pip3` 2023-05-22 19:04:01 +03:00
Iván Martínez 2f3aab9cfd Formatting fixes 2023-05-20 12:29:36 +02:00
Iván Martínez e74a11119c Show ingestion logs in readme 2023-05-20 12:15:13 +02:00
Iván Martínez 80b9b1d03e Better logs during ingestion 2023-05-20 12:11:21 +02:00
Iván Martínez 4a0e0d2e70 Use chunk_size variable in logs. Make vectorstore check more flexible 2023-05-20 12:02:40 +02:00
Iván Martínez fca1128fba Merge branch 'maozdemir-main' 2023-05-20 11:49:15 +02:00
Iván Martínez 7180d4386b Merge branch 'main' of https://github.com/maozdemir/privateGPT into maozdemir-main 2023-05-20 11:48:29 +02:00
Iván Martínez a86641cdec Readme small fixes following review and formatting 2023-05-20 11:22:45 +02:00
Iván Martínez fc50eb1b89 Merge branch 'abhiruka-main' 2023-05-20 11:21:35 +02:00
jiangzhuo cb7c96b31d Add progress bar to load_documents function
Enhanced the load_documents() function by adding a progress bar using the tqdm library. This change improves user experience by providing real-time feedback on the progress of document loading. Now, users can easily track the progress of this operation, especially when loading a large number of documents.
2023-05-20 11:16:13 +02:00
jiangzhuo e3b769d33a Optimize load_documents function with multiprocessing 2023-05-20 11:16:13 +02:00
MDW 04f6706bbb Make scripts executeable, add basic pre-commit setup 2023-05-20 11:15:58 +02:00
Iván Martínez 20554a7c9d
Merge pull request #292 from jiangzhuo/feature/multiprocessing-for-document-loading
Optimize load_documents function with multiprocessing
2023-05-20 10:57:42 +02:00
Iván Martínez b30cd52136
Merge pull request #271 from mdeweerd/executable_python
Make scripts executable, add basic pre-commit setup
2023-05-20 10:49:20 +02:00
Abhiruka be1bcbca37
Merge branch 'imartinez:main' into main 2023-05-20 07:42:26 +08:00
abhiruka f8805c80f8 Update as per the feedback.
- moved args parser inside main
- assigned empty list to docs.
- Updated README.md.
2023-05-20 07:40:05 +08:00
MDW 7f918a9fa1 Make scripts executeable, add basic pre-commit setup 2023-05-19 23:21:39 +02:00
Iván Martínez 22945bc91d
Merge pull request #299 from mdeweerd/elm_extended
Add fallback for plain elm #294 #290
2023-05-19 21:40:42 +02:00
abhiruka 9fb7f07e3c "Refactored main function to take hide_source and mute_stream parameters for controlling output. Added argparse for command-line argument parsing. StreamingStdOutCallbackHandler and source document display are now optional based on user input. Introduced parse_arguments function to handle command-line arguments. Also, updated README.md to reflect these changes." 2023-05-19 23:18:31 +08:00
MDW 4cda348cf8 Fix #294 (tested) 2023-05-19 16:23:09 +02:00
jiangzhuo ba0dbe8d1c Add progress bar to load_documents function
Enhanced the load_documents() function by adding a progress bar using the tqdm library. This change improves user experience by providing real-time feedback on the progress of document loading. Now, users can easily track the progress of this operation, especially when loading a large number of documents.
2023-05-19 10:59:38 +09:00
jiangzhuo 81b221bccb Optimize load_documents function with multiprocessing 2023-05-19 10:58:28 +09:00
MDW a862ff2be6 Add fallback for plain elm #294 #290 2023-05-19 01:04:42 +02:00
Iván Martínez ad64589c8f
Merge pull request #231 from milescattini/patch-1
Add fix for clang install of non m1 mac
2023-05-18 23:51:36 +02:00
Iván Martínez b9f8dc312f
Merge pull request #254 from Fabio3rs/formatOffice97-2003
Add .doc .ppt (Word and PowerPoint 97/2003 formats)
2023-05-18 23:49:40 +02:00
Iván Martínez 1590c5890f Update requirements 2023-05-18 23:23:11 +02:00
impulsivus 7844553ca1
Implement a way of ingesting more documents
Move environment variables to the global scope
Add a better check for vectorstore existence
Introduced a new function for better readability
Co-authored-by: Pulp <51127079+PulpCattel@users.noreply.github.com>
2023-05-18 17:45:38 +03:00
Iván Martínez 42046c5ec0
Merge pull request #268 from vilaca/dotenv-called-twice
remove duplicate call 'load_dotenv()' in ingester.py
2023-05-18 15:15:17 +02:00
milescattini 2360728fab
Fix Typo in Mac on Intel 2023-05-18 18:02:54 +10:00
Fabio Rossini Sluzala ec126b51d8
Fix loader mapping order 2023-05-17 22:38:30 -03:00
vilaca 79a3c00313 remove duplicate 2023-05-17 23:45:27 +01:00
Fabio Rossini Sluzala 652401cf29
Add the formats to the README.md 2023-05-17 13:53:46 -03:00
Fabio Rossini Sluzala 66a9f9cde0
Add .doc .ppt (Word and PowerPoint 97/2003 formats) 2023-05-17 12:04:16 -03:00
Iván Martínez 355b4be7c0
Merge pull request #224 from imartinez/feature/sentence-transformers-embeddings
Feature/sentence transformers embeddings
2023-05-17 10:56:34 +02:00
Iván Martínez 83797ec08b
Merge pull request #240 from zishon89us/patch-1
pypandoc-binary replacing pandoc-binary
2023-05-17 09:25:14 +02:00