在微调T2I模型上进行对齐方式,但没有重新调整任何人类反馈。Dream-057 Sync背后的关键见解是利用视觉语言mod- 058 ELS(VLMS)的进步,该eLS(VLMS)可以识别生成的图像和用户的输入060文本之间的细粒度差异-059 CIE [7,20]。在高水平上直观地,我们的方法可以将061视为具有人为反馈(RLHF)的强化学习的可扩展版本;正如Llama2 [49] 063使用人类反馈进行了迭代精制一样,DreamSync 064使用VLMS的反馈改善了T2I模型,除了065,而无需加固学习。066给定了一组文本提示,T2i模型首发-067每个提示都有多个候选图像。DreamSync 068使用两个069 VLM自动评估这些生成的图像。第一个测量世代的忠诚070对文本[7,20],而第二个则测量美学071质量[23]。最佳世代被收集并使用072使用参数有效的lora 073 Finetuning [19]。使用新的FineTuned T2I模型,我们重新进行了多个迭代的整个过程:生成IM-075年龄,策划新的填充设置,然后再次进行Finetune。076我们使用最新的基准-077分和人类评估进行广泛的实验。我们使用两个T2I模型SDXL [37]和SD V1.4 [39]实验Dreamsync 078。两种模型的结果079都表明Dreamsync增强了Align-080
ICLR 2025交织的场景图,用于交织的文本和图像生成评估。Dongping Chen,Ruoxi Chen,Shu Pu,Zhaoyi Liu,Yanru Wu,Caixi Chen,Caixi Chen,Benlin Liu,Yue Huang,Yao Wan,Pan Zhou,Ranjay Krishna International International In In Machine Learning,Machine Learning,2025 ICLR 2025 ICLR 2025 AHA:一个视觉语言的人,以实现失败的竞争,并合理地覆盖了竞争者,并合理地覆盖了杂物。众包工作流的技术。Madeleine Grunde-McLaughlin,Michelle S. Lam,Ranjay Krishna,Daniel S. Weld,Je Q rey Heer Heer ACM ACM Transactions on Computer-Human互动Neurips Neurips Neurips 2024 Dist Me Night Me。Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna Advances in neural information processing systems, 2024 NeurIPS 2024 Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models .Yushi Hu*,Weijia Shi*,Xingyu Fu,Dan Roth,Mari Ostendorf,Luke Zettlemoyer,Noah A Smith*,Ranjay Krishna*神经信息处理系统的进步,2024年Neurips 2024 Neurips 2024多语言多样性多样性多样性的多样性改善视觉语言表现。Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna* Advances in neural information processing systems, 2024 Spotlight Paper award (awarded to top 5%) NeurIPS 2024 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Per- forms Better .Scott Geng,Cheng-Yu Hsieh,Vivek Ramanujan,Matthew Wallingford,Chun-Liang Li,Pang Wei Koh*,Ranjay Krishna*神经信息处理系统的进步,2024 Neurips,Neurips 2024 2024 ActionAtlas:Actionatlas:a Videoqa-benchmark for Videoqa Benchmark for-Frain grave grave grave vrained Capention conterition。Mohammadreza Salehi, Jae Sung Park, Aditya Kusupati, Ranjay Krishna , Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi Advances in neural information processing systems, 2024 NeurIPS 2024 NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples .Wenxuan Peng,Baiqi Li,Zhiqiu Lin,Jean de Dieu Nyandwi,Zixian MA,Simran Khanuja,Deva Ramanan,Ranjay Krishna,Graham Neubig在神经信息处理系统中的进步,2024 Neurips 2024 Neurips 2024 Neurips 2024 Superpuse Supperections singleferess singleferess inderfection in Deciatsions nicledere nitferations in Deciatsions niclederiate bulyse nitferiations in Deciatsions anderfelions in Deciatsions:多个世代。Ethan Shen,Alan Fan,Sarah M Pratt,Jae Sung Park,Matthew Wallingford,Sham M Kakade,Ari Holtzman,Ari Holtzman,Ranjay Krishna,Ali Farhadi,Aditya Kusupati在神经信息处理系统中的进步,2024