Warning: mkdir(): No space left on device in /var/www/hottg/post.php on line 59

Warning: file_put_contents(aCache/aDaily/2025-07-20/post/opendatascience/-2282-2283-): Failed to open stream: No such file or directory in /var/www/hottg/post.php on line 72
коллеги из университета Циньхуа выпустили работу под названием Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? (А точно ли обучение с подкреплением расширяет мыслительные возможности моделей?) @Data Science by ODS.ai 🦜

TG Telegram Group & Channel

Data Science by ODS.ai 🦜 | United States America (US)

Create: 2025-05-01 Update: 2025-07-20 18:36:32

коллеги из университета Циньхуа выпустили работу под названием Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? (А точно ли обучение с подкреплением расширяет мыслительные возможности моделей?)

в ней они приходят к выводу, что нет, базовая модель остается лучше на длинной дистанции; я высказывал такого рода сомнение еще про Qwen, но тут уже полноценное подтверждение; отдельно хочу выразить восхищение визуальным оформлением результатов, очень доходчиво

Data Science by ODS.ai 🦜

Forwarded from Valuable AI / Валентин Малых

This media is not supported in your browser

VIEW IN TELEGRAM

This media is not supported in your browser

VIEW IN TELEGRAM

коллеги из университета Циньхуа выпустили работу под названием Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? (А точно ли обучение с подкреплением расширяет мыслительные возможности моделей?)

в ней они приходят к выводу, что нет, базовая модель остается лучше на длинной дистанции; я высказывал такого рода сомнение еще про Qwen, но тут уже полноценное подтверждение; отдельно хочу выразить восхищение визуальным оформлением результатов, очень доходчиво

👍12

hottg.com/opendatascience/2282

4.02K viewsMay 1 at 11:50

>>Click here to continue<<

Data Science by ODS.ai 🦜

Share with your best friend

Telegram Desktop App Not Working on Windows?

Run Telegram in Compatibility Mode

The fix won't work if the app was functioning normally a few days ago and you haven't made drastic changes to the OS or updated Windows. If the issue appears after a Windows update, or you're running an older version of Telegram, it's possible the current version of the app isn't compatible with your operating system. Therefore, update your app to the most recent and compatible version or roll back your Windows update.If you can't do either, run the app in compatibility mode. You might not have used it before, but it is one of the handy hidden modes in Windows.However, before doing that, run the compatibility troubleshooter that may resolve the issue right away. To run the troubleshooter, right-click on Telegram's shortcut and go to Properties. Navigate to the Compatibility tab in the Telegram Properties window and click Run compatibility troubleshooter.If the troubleshooter doesn't identify the issue, manually adjust compatibility settings. To do so, follow these steps: Check the box for Run this problem in compatibility mode for: and select Windows 8 from the dropdown menu. Hit OK after clicking Apply.

коллеги из университета Циньхуа выпустили работу под названием Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? (А точно ли обучение с подкреплением расширяет мыслительные возможности моделей?)

Data Science by ODS.ai 🦜 TG
Webview: 2282
Telegram TG Webview: hottg.com/opendatascience/webview
Telegram TG Channel: Data Science by ODS.ai 🦜
Telegram Updated:
Warning: filemtime(): stat failed for aCache/aDaily/2025-07-20/post/opendatascience/-2282-2283- in /var/www/hottg/post.php on line 338
1970-01-01 00:00:00

United States America Popular Telegram Group (US)

Warning: Undefined array key 3 in /var/www/hottg/function.php on line 115

Fatal error: Uncaught mysqli_sql_exception: Can't create/write to file '/tmp/#sql-temptable-a06e-5d8a55-2f69.MAI' (Errcode: 28 "No space left on device") in /var/www/hottg/function.php:216 Stack trace: #0 /var/www/hottg/function.php(216): mysqli_query() #1 /var/www/hottg/function.php(115): select() #2 /var/www/hottg/post.php(351): daCache() #3 /var/www/hottg/route.php(63): include_once('...') #4 {main} thrown in /var/www/hottg/function.php on line 216