A
AptSelect
Dev Tools
Описание
A local LLM client for parallel testing and evaluation — I built AptSelect to stop writing throwaway scripts every time I needed to test how different LLMs handle specific instructions and prompt edge cases.What it does:Parallel Execution: Send a single prompt to OpenAI, Anthropic, Mistral, and Gemini simultaneously. Compare the outputs, latency, and exact token usage side-by-side.Batch Evaluations: Upload a CSV dataset to run bulk tests across multiple models at once.Manual Diagnostics: Grade outputs manually (Pass/Fail) and assign diagnostic tags (e.g., Hallucination, Format Error) to b
Теги
#llm#devtools
По данным Hacker News · Перевод сгенерирован автоматически