Переходьте в офлайн за допомогою програми Player FM !
Подкасти, які варто послухати
РЕКЛАМА
Orca 2: Enhancing Reasoning in Smaller Language Models - Technical Details
Manage episode 421181730 series 3474159
This story was originally published on HackerNoon at: https://hackernoon.com/orca-2-enhancing-reasoning-in-smaller-language-models-technical-details.
Orca 2 enhances small language models' reasoning by teaching diverse strategies for tasks, outperforming models up to 10x larger in complex benchmarks.
Check more stories related to programming at: https://hackernoon.com/c/programming. You can also check exclusive content about #language-models, #orca-2, #reasoning-techniques, #machine-learning, #small-models, #imitation-learning, #ai-benchmarks, #model-training, and more.
This story was written by: @textmodels. Learn more about this writer by checking @textmodels's about page, and for more stories, please visit hackernoon.com.
The Orca 2 dataset has four main sources:FLAN: Our main source of prompts for synthetic data generation is the FLAN-v2 Collection 33, which consists of five sub-collections. Following Orca 1 42, we consider tasks from only CoT, NiV2, T0, Flan 2021 and Dialogue. Some of the tasks are associated with an associated answer. For the Cautious Reasoning dataset we selected ~602 zero-shot user queries from the split of 1448 high quality tasks out of 1913.
346 епізодів
Manage episode 421181730 series 3474159
This story was originally published on HackerNoon at: https://hackernoon.com/orca-2-enhancing-reasoning-in-smaller-language-models-technical-details.
Orca 2 enhances small language models' reasoning by teaching diverse strategies for tasks, outperforming models up to 10x larger in complex benchmarks.
Check more stories related to programming at: https://hackernoon.com/c/programming. You can also check exclusive content about #language-models, #orca-2, #reasoning-techniques, #machine-learning, #small-models, #imitation-learning, #ai-benchmarks, #model-training, and more.
This story was written by: @textmodels. Learn more about this writer by checking @textmodels's about page, and for more stories, please visit hackernoon.com.
The Orca 2 dataset has four main sources:FLAN: Our main source of prompts for synthetic data generation is the FLAN-v2 Collection 33, which consists of five sub-collections. Following Orca 1 42, we consider tasks from only CoT, NiV2, T0, Flan 2021 and Dialogue. Some of the tasks are associated with an associated answer. For the Cautious Reasoning dataset we selected ~602 zero-shot user queries from the split of 1448 high quality tasks out of 1913.
346 епізодів
Усі епізоди
×


1 Step-by-Step Guide to Publishing Your First Python Package on PyPI Using Poetry: Lessons Learned 4:05












1 AOSP and Linux Cross Border Convergence! Look at OpenFDE, New Open Source Linux Desktop Environment 3:16




1 Is Your Reporting Software WCAG Compliant? Make Data Accessible to Everyone with Practical Steps 14:36











1 TypeScript SDK Development: A 5-Year-Old Could Follow This Step-By-Step ~ Part 1: Our First MVP 4:15




1 Load Balancing For High Performance Computing
Using Quantum Annealing: Grid Based Application 12:00

1 Load Balancing For High Performance Computing
Using Quantum Annealing: Adaptive Mesh Refinement 4:57















1 How to Create Scrollable Lists with Protocol-Oriented Design & UICollectionViewCompositionalLayout 4:03




























































1 An Interview With Carl Cervone: On Open Source, Digital Public Goods Funding, and Impact Tracking 12:08



1 Empowering Newbies: Building Confidence Through 600+ LeetCode Solutions – A Guide for Beginners 9:20
















































1 274 Stories To Learn About Software 1:11:53



1 364 Stories To Learn About Html 1:34:00







1 The First 100: Proven Tactics From Stripe, Zapier & Convertkit That Get You the Users You Want 7:19

1 How to Use Versatile Data Kit to Turn Your Jupyter Notebooks Into Scalable & Reliable Data Pipelines 9:03



1 342 Stories To Learn About Software Architecture 1:21:03




1 535 Stories To Learn About Python 2:23:30

1 279 Stories To Learn About Programming 1:16:55


1 411 Stories To Learn About Nodejs 1:49:34


1 475 Stories To Learn About Mobile App Development 1:56:12




1 419 Stories To Learn About Kubernetes 1:44:49


1 334 Stories To Learn About Javascript Development 1:09:43

1 506 Stories To Learn About Java 2:03:09




1 341 Stories To Learn About Testing 1:20:08



Ласкаво просимо до Player FM!
Player FM сканує Інтернет для отримання високоякісних подкастів, щоб ви могли насолоджуватися ними зараз. Це найкращий додаток для подкастів, який працює на Android, iPhone і веб-сторінці. Реєстрація для синхронізації підписок між пристроями.