【论文阅读笔记】Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
2024-11-24
提出了一种名为DUET(Dual-scale Graph Transformer)的方法,结合了全局动作规划和细粒度跨模态理解
1428 words
|
7 minutes
Scaling Data Generation in Vision-and-Language Navigation
2024-11-22
作者提出了ScaleVLN,一种VLN数据生成范式,通过全面的实验证明了构建高质量导航图和使用相机质量图像的有效性
472 words
|
2 minutes
【论文阅读笔记】Building Rome in a Day
2024-11-02
基于photo tourism这篇文章进行的改进,在其基础上做的主要贡献是设计一个高计算性能平台,使得可以在几十小时内用几十万张互联网上抓取的互不相关的无序图像重建出一整个城市
757 words
|
4 minutes
【论文阅读笔记】DirectGPT: A Direct Manipulation Interface to Interact with Large Language Models
2024-09-22
一个基于直接操作(Direct Manipulation)原则来与LLM交互的用户界面
443 words
|
2 minutes
【论文阅读笔记】CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator
2024-09-11
一个由llm驱动的编程助手
1243 words
|
6 minutes
【论文阅读笔记】CodeHelp: Using Large Language Models with Guardrails for Scalable Support in Programming Classes
2024-09-11
辅助解决编程问题的带有Guardrails的llm助手
577 words
|
3 minutes
【论文阅读笔记】Teach AI How to Code: Using Large Language Models as Teachable Agents for Programming Education
2024-09-11
用户通过作为一个teacher教AI编程,LBT(learning by teaching)
1772 words
|
9 minutes
【论文阅读笔记】CodeTailor: LLM-Powered Personalized Parsons Puzzles for Engaging Support While Learning Programming
2024-09-10
alt text
1864 words
|
9 minutes
