Skip to content

a2-nabil/PDF-Extractor-by-Nabil

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

# PDF Extractor

A simple tool for extracting images and text from PDF documents. This project allows users to easily retrieve content from their PDFs for further processing or analysis.

## Features

- Extracts text content from PDF files
- Extracts images from PDF files
- Supports multiple PDF formats
- Easy to use command-line interface

## Requirements

- Python 3.x
- php
- jquery
- Required libraries:
  - `PyMuPDF`

You can install the required libraries using pip:

```bash
pip install PyMuPDF

Usage

  1. Clone the repository:

    git clone https://github.com/a2-nabil/PDF-Extractor-by-Nabil.git
    cd pdf-extractor
  2. Run the extractor:

    python pdf_extractor.py <path_to_pdf>
  3. Extracted content will be saved in the output directory.

Example

This command will extract all text and images from example.pdf and save them in the output directory.

Contributing

Contributions are welcome! If you have suggestions or improvements, feel free to open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

  • PyMuPDF for converting PDF pages to images.

Contact

For questions or feedback, please contact me at [a2nabil000@gmail.com].

About

a pdf extractor . It can extract images and texts from your pdf.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors