A Python library providing evaluation metrics to compare generated texts from LLMs, often against reference texts. Features streamlined workflows for model comparison and visualization.
python nlp machine-learning natural-language-processing text-analysis ai-evaluation large-language-models llm genai evaluation-metircs text-comparision
-
Updated
Oct 30, 2025 - Python