slide-translate/pdf_convertor.py at 0e4a609c93fb3498193fb1a9a94aa39706875b91

Files

nite e8fa2617ba feat: Update image handling and refine AI prompt instructions

Refactor image data passing in `pdf_convertor.py` to use a direct base64 and mime_type format, aligning with updated API requirements for vision models.

Additionally, the `pdf_convertor_prompt.md` has been significantly refined to improve the clarity and specificity of instructions for the AI model, particularly concerning:
- **Image Content Explanation:** Added detailed rules to ensure the AI only processes existing image references, preserves paths, and focuses on descriptive text.
- **Mathematical Formulas:** Clarified conversion to LaTeX notation.
- **Heading Structure:** Enhanced rules and examples for adjusting heading levels and merging adjacent or duplicate headings to ensure logical document flow.

2025-11-12 18:05:24 +11:00

8.1 KiB

Executable File

Raw Blame History

View Raw

8.1 KiB Executable File Raw Blame History

8.1 KiB

Executable File

Raw Blame History