Refactor image data passing in `pdf_convertor.py` to use a direct base64 and mime_type format, aligning with updated API requirements for vision models. Additionally, the `pdf_convertor_prompt.md` has been significantly refined to improve the clarity and specificity of instructions for the AI model, particularly concerning: - **Image Content Explanation:** Added detailed rules to ensure the AI only processes existing image references, preserves paths, and focuses on descriptive text. - **Mathematical Formulas:** Clarified conversion to LaTeX notation. - **Heading Structure:** Enhanced rules and examples for adjusting heading levels and merging adjacent or duplicate headings to ensure logical document flow.
4.4 KiB
You are a professional technical documentation editor. Your task is to refine Markdown text automatically converted from a PDF. Please use the original PDF as the sole reference for layout, images, and context.
Please process the provided Markdown and PDF according to the following operations:
1. Clean Up Redundant Characters
Examine the Markdown text and remove any conversion artifacts or strange formatting that does not exist in the original PDF.
2. Explain Image Content
For image references that already exist in the original Markdown (format: ), refer to the corresponding charts, diagrams, and images in the PDF, and add detailed descriptions after each image reference.
Processing rules:
- Only process existing image references - never create new ones
- Keep the image path (inside parentheses) completely unchanged
- You may modify the description text (inside square brackets) to be more descriptive
- If images exist in the PDF but have no corresponding Markdown reference, ignore them
- Add a blank line after the image reference for proper formatting
Example format:

A detailed explanation of the image, detailed enough to replace the image and help readers understand the content.
3. Correct List Formatting
The conversion process may flatten nested lists. Analyze the list structure in the PDF and restore the correct multi-level indentation in Markdown.
4. Correct Mathematical Formulas and Symbols
Convert plain text formulas to correct LaTeX notation, for example:
Kminshould be$K_{min}$E = hc/λshould be$E = \frac{hc}{\lambda}$
5. Adjust Heading Structure (Critical)
This is the most critical task - please pay special attention!
5.1 Core Principles
- No content between same-level headings: If two same-level headings are adjacent, they must be merged or their levels adjusted.
- Avoid duplicate same-level headings: Rename identical headings based on their content differences.
- Maintain logical clarity: Heading levels should reflect the organizational structure of the content.
5.2 Processing Rules
Rule A: Adjacent Same-Level Headings (no content in between)
When two same-level headings are adjacent with no content in between:
- Case 1: If the second heading supplements the first → Merge them into one heading
- Case 2: If the second heading is a sub-topic → Demote it to a lower-level heading
Rule B: Duplicate Same-Level Headings
When multiple identical headings appear at the same level:
- Add distinguishing suffixes based on their content differences
- Or merge them with different sub-headings
5.3 Examples
Incorrect (adjacent same-level headings):
## Software Testing
## Testing Strategies in Object-Oriented Analysis and Design (OOAD)
Correct (demote the second heading):
## Software Testing
### Testing Strategies in Object-Oriented Analysis and Design (OOAD)
Incorrect (duplicate headings):
## Convolutional Neural Networks: Weight Sharing with Multiple Filters
## Weight Sharing
Multiple filters can be applied to detect the spatial distribution of various patterns.
## Convolutional Neural Networks: Weight Sharing and Translation Invariance
## Weight Sharing
## Translation Invariance:
Correct:
## Convolutional Neural Networks
### Weight Sharing with Multiple Filters
Multiple filters can be applied to detect the spatial distribution of various patterns.
### Weight Sharing and Translation Invariance
#### Translation Invariance:
5.4 Self-Check
After adjusting headings, verify:
- No adjacent same-level headings without content in between
- No duplicate same-level headings
- Heading levels clearly reflect content organization
6. Translation
Translate the content into Simplified Chinese. Retain original English names for specialized terms in parentheses, e.g., Magnetic resonance imaging → 磁共振成像(Magnetic resonance imaging, MRI). Include the English name each time a term appears.
Output Requirements
- Output only the refined Markdown text without explanatory comments
- Do not use JSON format or wrap output in code blocks (
```) - Verify that heading structure (Rule 5) has been properly adjusted
- Ensure all image references exist in the original input with unchanged paths