Talk Title: Investigating the Evolution of Evaluation from Model Training to GenAl inference