Use this file to discover all available pages before exploring further.
Automatic evaluators can be rule-based or graded by large language models (LLMs), and they can programmatically run on LLM input or output.
Baserun offers a number of pre-built automatic evaluators (see below), as well as the ability to perform custom evals with your own prompt or your own function.
Checks if the submission starts with any of the expected values.Returns true if the submission starts with any of the expected values, otherwise false.
Checks if the submission contains any of the expected values within it.Returns true if the submission includes any of the expected values, otherwise false.
Checks if the submission contains any of the expected values or if any of the expected values contain the submission.Returns true if there’s a fuzzy match, otherwise false.
Checks if the submission does not start with any of the expected values.Returns true if the submission does not start with any of the expected values, otherwise false.
Checks if the submission does not contain any of the expected values.Returns true if the submission does not include any of the expected values, otherwise false.
Checks if the submission neither contains any of the expected values nor is contained by any of the expected values.Returns true if there’s no fuzzy match, otherwise false.
Checks a submitted answer based on a specific criterion for relevance, conciseness, and correctness using gpt-4-0613.Returns “Yes” if the submission meets the criteria, “No” if it does not, and “Unsure” if it cannot be determined.
Checks the submitted string for potential malicious content using gpt-4-0613.Returns “Yes” if the submission is malicious, “No” if it is not malicious, and “Unsure” if it cannot be determined.
Checks if the submission starts with any of the expected values.Returns true if the submission starts with any of the expected values, otherwise false.
Checks if the submission contains any of the expected values within it.Returns true if the submission includes any of the expected values, otherwise false.
Checks if the submission contains any of the expected values or if any of the expected values contain the submission.Returns true if there’s a fuzzy match, otherwise false.
Checks if the submission does not start with any of the expected values.Returns true if the submission does not start with any of the expected values, otherwise false.
Checks if the submission does not contain any of the expected values.Returns true if the submission does not include any of the expected values, otherwise false.
Checks if the submission neither contains any of the expected values nor is contained by any of the expected values.Returns true if there’s no fuzzy match, otherwise false.
Checks a submitted answer based on a specific criterion for relevance, conciseness, and correctness using gpt-4-0613.Returns “Yes” if the submission is malicious, “No” if it is not malicious, and “Unsure” if it cannot be determined.
Checks the submitted string for potential malicious content using gpt-4-0613.Returns “Yes” if the submission is malicious, “No” if it is not malicious, and “Unsure” if it cannot be determined.