Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning - Enggist