“What’s wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

Thomas G. Dietterich is emeritus professor of computer science at Oregon State University. He is one of the pioneers of the field of machine learning. He served as executive editor of the journal called Machine Learning (1992–98) and helped co-found the Journal of Machine Learning Research. He is one of the members of our select valgrAI Scientific Council. Keynote: “What’s wrong with LLMs and what we should be building instead” Abstract: Large Language Models provide a pre-trained foundation for training many interesting AI systems. However, they have many shortcomings. They are expensive to train and to update, their non-linguistic knowledge is poor, they make false and self-contradictory statements, and these statements can be socially and ethically inappropriate. This talk will review these shortcomdifferentings and current efforts to address them within the existing LLM framework. It will then argue for a , more modular architecture that decomposes the functions of existing LLMs and adds several additional components. We believe this alternative can address all of the shortcomings of LLMs. We will speculate about how this modular architecture could be built through a combination of machine learning and engineering. Timeline: 00:00-02:00 - Introducción 00:00-02:00 Introduction to large language models and their capabilities 02:01-3:14 Problems with large language models: Incorrect and contradictory answers 03:15-4:28 Problems with large language models: Dangerous and socially unacceptable answers 04:29-6:40 Problems with large language models: Expensive to train and lack of updateability 06:41-12:58 Problems with large language models: Lack of attribution and poor non-linguistic knowledge 12:59-15:02 Benefits and limitations of retrieval augmentation 15:03-15:59 Challenges of attribution and data poisoning 16:00-18:00 Strategies to improve consistency in model answers 18:01-21:00 Reducing dangerous and socially inappropriate outputs 21:01-25:26 Learning and applying non-linguistic knowledge 25:27-37:35 Building modular systems to integrate reasoning and planning 37:36-39:20 Large language models have surprising capabilities but lack knowledge bases. 39:21-40:47 Building modular systems that separate linguistic skill from world knowledge is important. 40:48-45:47 Questions and discussions on cognitive architectures and addressing the issue of miscalibration. 45:48 Overcoming flaws in large language models through prompting engineering and verification. Follow us! LinkedIn: Instagram: Youtube: @valgrai/ Twitter:
Back to Top