Skip to content
#

text-evaluation

Here are 8 public repositories matching this topic...

STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하�� Python 함수 패키지

  • Updated Jun 18, 2025
  • Python

A Streamlit web app that uses a Groq-powered LLM (Llama 3) to act as an impartial judge for evaluating and comparing two model outputs. Supports custom criteria, presets like creativity and brand tone, and returns structured scores, explanations, and a winner. Built end-to-end with Python, Groq API, and Streamlit.

  • Updated Nov 24, 2025
  • Python

🛠️ Evaluate unified models effortlessly with ULMEvalKit, your open-source toolkit for comprehensive image generation benchmarks and streamlined workflows.

  • Updated Dec 1, 2025
  • Python

Improve this page

Add a description, image, and links to the text-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more