Improving Large Language Model by Systematically Improving its Data

Labeled data powers AI/ML in the enterprise, but real-world datasets have been found to contain between 7-50% annotation errors. Imperfectly labelled text data hampers ML models’ training (and evaluation) across tasks like intent recognition, entity recognition, and sequence generation. Although pretrained LLMs are equipped with a lot of world knowledge, their performance is adversely affected by noisy training data (as noted by OpenAI). In this talk, we illustrate data-centric techniques to mitigate the effect of label noise without changing any code related to model architecture, hyperparameters, or training. These data quality improvement techniques should thus remain applicable even for future advanced LLMs like GPT-10. Resources for this session: Slides: https://docs.google.com/presentation/... Other resources: https://cleanlab.ai/blog/fine-tune-LLM/ https://colab.research.google.com/git... https://docs.cleanlab.ai/stable/index... [eventID:21923]

Improving Large Language Model by Systematically Improving its Data

Microsoft Reactor

Improving Large Language Model by Systematically Improving its Data

Comments 1

Description

Microsoft Reactor

Transcript

Stanford Webinar - Agentic AI: A Progression of Language Model Usage

How to Build an LLM from Scratch | An Overview

Gemini 101: Your Essential Guide to Google's Next-Gen AI

AI Forum 2023 | The Small Models Revolution

"Human Language is the new UI. How is this possible?" By Bill Wilder

4. DP-700: Data warehousing in Microsoft Fabric

Develop an AI agents with Semantic Kernel | #MVPConnect

Lawrence: Retreating on tariffs 'confused illiterate clown' Trump admits he's too weak to do his job

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

885: Python Polars: The Definitive Guide — with Jeroen Janssens and Thijs Nieuwdorp

Single Systems | Understanding Quantum Information & Computation - Lesson 01

Learn Live: Develop an AI agent with Semantic Kernel - Training

Retrieval Augmented Generation for Navigating Large Enterprise Documents

What is Retrieval-Augmented Generation (RAG)?

Building a News Fact-Checker AI Agent

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

Cheating Expert Answers Casino Cheating Questions | Tech Support | WIRED

Gutfeld: Now I understand why we have cops

Jeremy Utley: AI Ignites Human Creativity

How Large Language Models Work