feat(minimax-pdf-read): add PDF reading skill for text extraction#51
Open
TumCucTom wants to merge 2 commits intoMiniMax-AI:mainfrom
Open
feat(minimax-pdf-read): add PDF reading skill for text extraction#51TumCucTom wants to merge 2 commits intoMiniMax-AI:mainfrom
TumCucTom wants to merge 2 commits intoMiniMax-AI:mainfrom
Conversation
divitkashyap
added a commit
to divitkashyap/skills
that referenced
this pull request
Apr 6, 2026
…ability Submitted by: https://github.com/divitkashyap ## What Added — a skill that provides automatic fallback when an agent states it cannot read PDFs. Intercepts the limitation and extracts PDF text using command-line tools with user confirmation for installation. ## Why When AI agents lack native PDF reading capability, they either fail or give generic responses. This skill provides a complete fallback workflow: 1. Detect when agent says 'I cannot read PDFs' (or similar phrases) 2. Check for available tools (pdftotext → pdfplumber → pymupdf) 3. If no tool found, ask user permission to install 4. Extract PDF text to temp file 5. Continue with original user task ## Complementary to minimax-pdf-read This skill differs from (PR MiniMax-AI#51): - minimax-pdf-read: User explicitly asks to extract text from a PDF (active) - pdf-reader: Agent says it can't read PDFs → fallback workflow (passive) Both can coexist — they serve different trigger conditions. ## Tool Priority 1. pdftotext (poppler-utils) — Preferred, fastest, system-level 2. pdfplumber (Python) — Fallback if poppler not available 3. pymupdf (Python) — Alternative Python fallback ## Platform Support - macOS: Homebrew (brew install poppler) or pip - Linux: apt-get/dnf install poppler-utils or pip - Windows: winget/chocolatey or pip ## Key Features - Automatic detection of agent PDF limitation - Multi-tool fallback strategy - User confirmation before installation - Platform-specific installation commands - Layout preservation (-layout flag) - Page range extraction (-f, -l flags) - Error handling for encrypted/protected PDFs ## Example Triggers - 'I cannot read PDFs' - 'I don't have the ability to read PDFs' - 'I can't access PDF content' ## Files - skills/pdf-reader/SKILL.md — Complete skill with workflow - README.md, README_zh.md — Updated with new entry ## Validation All 15 skills pass: python .claude/skills/pr-review/scripts/validate_skills.py ✅
divitkashyap
added a commit
to divitkashyap/skills
that referenced
this pull request
Apr 6, 2026
Submitted by: https://github.com/divitkashyap ## What Added — a skill that provides automatic PDF text extraction fallback using command-line tools (pdftotext/poppler-utils) with optional installation and user confirmation. ## Why When user shares a PDF or asks to read/extract text from it, and the agent lacks native PDF capability, this skill provides a complete fallback workflow: 1. Detect PDF file in user's message 2. Check for available tools (pdftotext → pdfplumber → pymupdf) 3. If no tool found, ask user permission to install 4. Extract PDF text to temp file 5. Continue with original user task ## Complementary to minimax-pdf-read (PR MiniMax-AI#51) This skill differs from : - minimax-pdf-read: User explicitly asks to extract text from a PDF (active) - pdf-reader: Fallback when agent needs to process PDF but lacks capability Both can coexist — they serve different use cases. ## Tool Priority 1. pdftotext (poppler-utils) — Preferred, fastest, system-level 2. pdfplumber (Python) — Fallback if poppler not available 3. pymupdf (Python) — Alternative Python fallback ## Platform Support - macOS: Homebrew (brew install poppler) or pip - Linux: apt-get/dnf install poppler-utils or pip - Windows: winget/chocolatey or pip ## Validation All 15 skills pass: python .claude/skills/pr-review/scripts/validate_skills.py ✅
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
minimax-pdf-readskill for extracting text and metadata from PDF documentsminimax-pdfwhich handles PDF generation/editingWhat
skills/minimax-pdf-read/SKILL.md— skill entry with trigger conditions and usage docsskills/minimax-pdf-read/scripts/extract.py— CLI tool with shebang, error handling, and CLI argsskills/minimax-pdf-read/scripts/requirements.txt— pypdf dependencyREADME.mdandREADME_zh.mdwith new skill entryWhy
The existing
minimax-pdfskill handles CREATE/FILL/REFORMAT — generating and editing PDFs. There was no skill for reading and extracting text from existing PDFs. This fills that gap.Validation
python .claude/skills/pr-review/scripts/validate_skills.pypasses