Skip to content

feat(minimax-pdf-read): add PDF reading skill for text extraction#51

Open
TumCucTom wants to merge 2 commits intoMiniMax-AI:mainfrom
TumCucTom:feat/minimax-pdf-read
Open

feat(minimax-pdf-read): add PDF reading skill for text extraction#51
TumCucTom wants to merge 2 commits intoMiniMax-AI:mainfrom
TumCucTom:feat/minimax-pdf-read

Conversation

@TumCucTom
Copy link
Copy Markdown

Summary

  • Add minimax-pdf-read skill for extracting text and metadata from PDF documents
  • Uses pypdf for local text extraction (no API keys required)
  • Complements existing minimax-pdf which handles PDF generation/editing

What

  • skills/minimax-pdf-read/SKILL.md — skill entry with trigger conditions and usage docs
  • skills/minimax-pdf-read/scripts/extract.py — CLI tool with shebang, error handling, and CLI args
  • skills/minimax-pdf-read/scripts/requirements.txt — pypdf dependency
  • Updated README.md and README_zh.md with new skill entry

Why

The existing minimax-pdf skill handles CREATE/FILL/REFORMAT — generating and editing PDFs. There was no skill for reading and extracting text from existing PDFs. This fills that gap.

Validation

  • python .claude/skills/pr-review/scripts/validate_skills.py passes

divitkashyap added a commit to divitkashyap/skills that referenced this pull request Apr 6, 2026
…ability

Submitted by: https://github.com/divitkashyap

## What
Added  — a skill that provides automatic fallback when an agent states it cannot read PDFs. Intercepts the limitation and extracts PDF text using command-line tools with user confirmation for installation.

## Why
When AI agents lack native PDF reading capability, they either fail or give generic responses. This skill provides a complete fallback workflow:
1. Detect when agent says 'I cannot read PDFs' (or similar phrases)
2. Check for available tools (pdftotext → pdfplumber → pymupdf)
3. If no tool found, ask user permission to install
4. Extract PDF text to temp file
5. Continue with original user task

## Complementary to minimax-pdf-read
This skill differs from  (PR MiniMax-AI#51):
- minimax-pdf-read: User explicitly asks to extract text from a PDF (active)
- pdf-reader: Agent says it can't read PDFs → fallback workflow (passive)

Both can coexist — they serve different trigger conditions.

## Tool Priority
1. pdftotext (poppler-utils) — Preferred, fastest, system-level
2. pdfplumber (Python) — Fallback if poppler not available
3. pymupdf (Python) — Alternative Python fallback

## Platform Support
- macOS: Homebrew (brew install poppler) or pip
- Linux: apt-get/dnf install poppler-utils or pip
- Windows: winget/chocolatey or pip

## Key Features
- Automatic detection of agent PDF limitation
- Multi-tool fallback strategy
- User confirmation before installation
- Platform-specific installation commands
- Layout preservation (-layout flag)
- Page range extraction (-f, -l flags)
- Error handling for encrypted/protected PDFs

## Example Triggers
- 'I cannot read PDFs'
- 'I don't have the ability to read PDFs'
- 'I can't access PDF content'

## Files
- skills/pdf-reader/SKILL.md — Complete skill with workflow
- README.md, README_zh.md — Updated with new entry

## Validation
All 15 skills pass: python .claude/skills/pr-review/scripts/validate_skills.py ✅
divitkashyap added a commit to divitkashyap/skills that referenced this pull request Apr 6, 2026
Submitted by: https://github.com/divitkashyap

## What
Added  — a skill that provides automatic PDF text extraction fallback using command-line tools (pdftotext/poppler-utils) with optional installation and user confirmation.

## Why
When user shares a PDF or asks to read/extract text from it, and the agent lacks native PDF capability, this skill provides a complete fallback workflow:
1. Detect PDF file in user's message
2. Check for available tools (pdftotext → pdfplumber → pymupdf)
3. If no tool found, ask user permission to install
4. Extract PDF text to temp file
5. Continue with original user task

## Complementary to minimax-pdf-read (PR MiniMax-AI#51)
This skill differs from :
- minimax-pdf-read: User explicitly asks to extract text from a PDF (active)
- pdf-reader: Fallback when agent needs to process PDF but lacks capability

Both can coexist — they serve different use cases.

## Tool Priority
1. pdftotext (poppler-utils) — Preferred, fastest, system-level
2. pdfplumber (Python) — Fallback if poppler not available
3. pymupdf (Python) — Alternative Python fallback

## Platform Support
- macOS: Homebrew (brew install poppler) or pip
- Linux: apt-get/dnf install poppler-utils or pip
- Windows: winget/chocolatey or pip

## Validation
All 15 skills pass: python .claude/skills/pr-review/scripts/validate_skills.py ✅
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant