Переходьте в офлайн за допомогою програми Player FM !
Подкасти, які варто послухати
РЕКЛАМА


[QA] How does Transformer Learn Implicit Reasoning?
Manage episode 485766258 series 3524393
This paper explores implicit multi-hop reasoning in large language models, revealing a developmental trajectory and introducing diagnostic tools to enhance interpretability and understanding of reasoning processes.
https://arxiv.org/abs//2505.23653
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2439 епізодів
Manage episode 485766258 series 3524393
This paper explores implicit multi-hop reasoning in large language models, revealing a developmental trajectory and introducing diagnostic tools to enhance interpretability and understanding of reasoning processes.
https://arxiv.org/abs//2505.23653
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2439 епізодів
Alle episoder
×
1 [QA] Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts 9:37

1 Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts 57:59

1 [QA] Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination 8:49

1 Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination 22:17

1 [QA] Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation 7:58

1 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation 27:15



1 [QA] Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful 7:03

1 Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful 18:57


1 [QA] Real-TabPFN: Improving Tabular Foundation Models via Continued Pre-training With Real-World Data 7:28

1 Real-TabPFN: Improving Tabular Foundation Models via Continued Pre-training With Real-World Data 10:15



1 [QA] Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 7:21

1 Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 15:33

1 [QA] Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling? 8:16

1 Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling? 16:52




Ласкаво просимо до Player FM!
Player FM сканує Інтернет для отримання високоякісних подкастів, щоб ви могли насолоджуватися ними зараз. Це найкращий додаток для подкастів, який працює на Android, iPhone і веб-сторінці. Реєстрація для синхронізації підписок між пристроями.