Skip to content

[Feature]: Index more types of documents #46

@Fahube

Description

@Fahube

Problem Statement

Few common text document types aren’t read by DocFinder.

Proposed Solution

Add support to read:

  • LibreOffice documents: odt, odp, odg
  • Microsoft Office: doc, ppt, pptx
  • Web documents: html
  • Personalised extensions for raw text: for example the user wants to index srt, log files or any other type of raw text file. The user should be able to specify which optional extension(s) he wants to add.

Alternatives Considered

To convert manually each document in pdf.

Use Cases

When the user wants to index a different type of document.

Priority

High - Critical for my use case

Contribution

  • I would be willing to implement this feature and submit a PR

Additional Context

No response

Checklist

  • I have searched existing issues to ensure this is not a duplicate
  • This feature aligns with the project's goals

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions