Description
- Evaluating specialized LLMs using Weave
- Productionizing the latest LLM-as-a-judge research
- Improving on your existing judge
- Building annotation UIs
Featured places
See more information in Google Maps
Transcript
Follow along using the transcript.
Show transcript
Microsoft Reactor
106K subscribers
Transcript
NaN / NaN
Show more