BiribiriBird

论文笔记 DLM

LLaDA-Rec - Discrete Diffusion for Parallel Semantic ID Generation in Generative Recommendation

LLaDA Series

LLM

Happy LLM · Part1 · Transformer

古法编程手搓代码

论文笔记 DLM

TiDAR - Think in Diffusion, Talk in Autoregression

看看其他工作

论文笔记 DLM

Awesome LLaDA

整理一下人大团队的LLaDA系列

论文笔记 DLM

LLaDA-MoE ASparse MoEDiffusion Language Model

follow

论文笔记 DLM

UltraLLaDA Scaling the Context Length to 128K for Diffusion Large Language Models

follow

强化学习的数学原理 · Chap6 · Stochastic Approximation and SGD

好好学习

强化学习的数学原理 · Chap5 · Monte Carlo Learning

好好学习

强化学习的数学原理 · Chap4 · Value Iteration and Policy Iteration

好好学习

强化学习的数学原理 · Chap3 · Bellman Optimality Equation

好好学习