# PDF Extractor
A simple tool for extracting images and text from PDF documents. This project allows users to easily retrieve content from their PDFs for further processing or analysis.
## Features
- Extracts text content from PDF files
- Extracts images from PDF files
- Supports multiple PDF formats
- Easy to use command-line interface
## Requirements
- Python 3.x
- php
- jquery
- Required libraries:
- `PyMuPDF`
You can install the required libraries using pip:
```bash
pip install PyMuPDF-
Clone the repository:
git clone https://github.com/a2-nabil/PDF-Extractor-by-Nabil.git cd pdf-extractor -
Run the extractor:
python pdf_extractor.py <path_to_pdf>
-
Extracted content will be saved in the
outputdirectory.
This command will extract all text and images from example.pdf and save them in the output directory.
Contributions are welcome! If you have suggestions or improvements, feel free to open an issue or submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.
- PyMuPDF for converting PDF pages to images.
For questions or feedback, please contact me at [a2nabil000@gmail.com].