slide-translate

Author	SHA1	Message	Date
nite	26951b8bc0	feat(llm): Add Ollama provider and PyMuPDF image extraction This commit introduces support for Ollama as an alternative Large Language Model (LLM) provider and enhances PDF image extraction capabilities. - Ollama Integration: - Implemented `set_ollama_config` to configure Ollama's base URL from `config.ini`. - Modified `llm.py` to dynamically select and configure the LLM (Gemini or Ollama) based on the `PROVIDER` setting. - Updated `get_model_name` to return provider-specific default model names. - `pdf_convertor.py` now conditionally initializes `ChatGoogleGenerativeAI` or `ChatOllama` based on the configured provider. - PyMuPDF Image Extraction: - Added a new `extract_images_from_pdf` function using PyMuPDF (`fitz`) for direct image extraction from PDF files. - Introduced `get_extract_images_from_pdf_flag` to control this feature via `config.ini`. - `convert_pdf_to_markdown` and `refine_content` functions were updated to utilize this new image extraction method when enabled. - Refinement Flow: - Adjusted the order of `save_md_images` in `main.py` and added an option to save the refined markdown with a specific filename (`index_refined.md`). - Dependencies: - Updated `pyproject.lock` to include new dependencies for Ollama integration (`langchain-ollama`) and PyMuPDF (`PyMuPDF`), along with platform-specific markers for NVIDIA dependencies.	2025-11-11 22:35:23 +11:00
nite	e05c15db16	u	2025-11-07 04:03:57 +11:00
nite	3eef042111	refactor(app): Extract PDF conversion logic into a separate module The main.py script was becoming monolithic, containing all the logic for PDF conversion, image path simplification, and content refinement. This change extracts these core functionalities into a new `pdf_convertor` module. This refactoring improves the project structure by: - Enhancing modularity and separation of concerns. - Making the main.py script a cleaner, high-level orchestrator. - Improving code readability and maintainability. The functions `convert_pdf_to_markdown`, `save_md_images`, and `refine_content` are now imported from the `pdf_convertor` module and called from the main execution block.	2025-10-27 20:02:02 +11:00
nite	4f29d5c814	feat(llm): Send images to model and enhance processing prompt	2025-10-25 22:51:54 +11:00
nite	37d4facee3	feat: Enable batch processing of PDF files and update README	2025-10-22 20:56:17 +11:00
nite	ad212a35af	init	2025-10-22 17:10:29 +11:00

6 Commits