it is just an integer, so it is easy to verify and not very subjective (although as you will see in the future lessons, there could still be some subjectivity even with this value)
it does need some reasoning, so it is a good test of an LLM’s ability to reason when it is extracting values
which symptom is the earliest one?
make sure the earliest symptom onset date isn’t indirectly mentioned already in the clinical narrative
if it is not mentioned, figure out the last date of the given month
calculate the difference to make a best guess estimate of the upper limit