Reply evaluation and monitoring results
View reply evaluation results
AI → Configuration → Reply Evaluation

Scoring metrics
The MaiAgent platform provides reply evaluation functionality, recording and automatically scoring each Q&A; scores include
Faithfulness score
Whether the LLM answered truthfully rather than fabricating an answer
LLM, RAG, Knowledge
✅
✅
Answer Relevance score
Whether the LLM addressed the key points, and whether the answer is incomplete or contains redundant text
LLM, RAG, Knowledge
✅
✅
Context Precision score
Whether the content retrieved by RAG is relevant to the question
RAG, Knowledge
✅
✅
Answer Correctness score
The correctness of the reply compared to the correct answer
LLM, RAG, Knowledge
✅
✅
Answer Similarity score
The semantic similarity between the reply and the correct answer
LLM, RAG, Knowledge
✅
✅
Context Recall score
Whether the content retrieved by RAG included the information compared to the correct answer
RAG, Knowledge
✅
✅

Clarifying causes of low scores and ways to resolve them
LLM capability issues: unable to answer based on reference materials
Solution: switch to a more capable LLM
RAG retrieval capability: whether it found data related to the question
Solution: contact MaiAgent support
Whether the knowledge base provides sufficient data
Solution: supplement the knowledge base with correct data and FAQ common questions
Last updated
Was this helpful?