Rating Criteria

Temporal Soundness

score10%llm

Description

Evaluates whether the sources will contain the correct information at the time of resolution. Uses LLM semantic analysis to assess temporal alignment.

How It Works

An LLM evaluates each source's likelihood of having the required data:

  1. 1.Analyzes URL structure for temporal indicators
  2. 2.Assesses if the source is likely to be updated with relevant data
  3. 3.Checks if the prompt's time context matches the source
  4. 4.Scores each source from 0-100

Scoring Thresholds

ScoreAssessment
80-100Sources appropriately encode the relevant time window
60-79Sources may contain information but timing is uncertain
0-59Sources unlikely to have information at the right time

What Is Evaluated

  • >Does the URL include date/time parameters?
  • >Is this a results page that will be populated?
  • >Does the source update in real-time or is it archived?
  • >Will historical data still be accessible?