实体
OpenReview
OpenReview
PulseAugur coverage of OpenReview — every cluster mentioning OpenReview across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
-
AI model finetuning mostly idempotent, DPO can amplify traits
A guide explores advanced techniques for post-training large language models, focusing on Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Group Relative Policy Optimization (GRPO). These methods …
-
AI solves complex inverse partial differential equations, a major math challenge
Researchers have employed artificial intelligence to solve a challenging class of mathematical problems known as inverse partial differential equations (PDEs). This AI-driven approach offers a novel method for finding s…