Ax Yucheng Sheng, Jiacheng Wang, Le Liang, Hao Ye, Shi Jin 7d ago

A Graph Foundation Model for Wireless Resource Allocation

Graph foundation model for wireless network resource allocation using deep learning to solve optimization problems more efficiently than classical iterative algorithms.

Ax Oleg Platonov, Liudmila Prokhorenkova 7d ago

Cluster Attention for Graph Machine Learning

Cluster Attention mechanism for graph transformers improving receptive field while preserving graph-structure inductive biases.

Ax Henry C. Conklin, Tom Hosking, Tan Yi-Chern, Julian Gold, Jonathan D. Cohen, Thomas L. Griffiths, Max Bartolo, Seraphina Goldfarb-Tarrant 7d ago

Learning is Forgetting: LLM Training As Lossy Compression

LLM training as lossy compression framework explaining how LLMs learn by retaining task-relevant information from training data.

Ax Junlong Jia, Ziyang Chen, Xing Wu, Chaochen Gao, TingHao Yu, Feng Zhang, Songlin Hu 7d ago

PolicyLong: Towards On-Policy Context Extension

PolicyLong method for extending LLM context windows using on-policy data synthesis to align with model capabilities during training.