Paper List
Below is a curated list of recent and influential papers grouped by research area. This list highlights cutting-edge developments in AI and its applications.
Multimodal AI
- “Flamingo: a Visual Language Model for Few-Shot Learning” (Alayrac et al., 2022)
- “CLIP: Learning Transferable Visual Models From Natural Language Supervision” (Radford et al., 2021)
- “ImageBind: One Embedding Space To Bind Them All” (Girdhar et al., 2023)
- “BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models” (Li et al., 2023)
AI in Healthcare
- “Foundation models for generalist medical artificial intelligence” (Singhal et al., 2023)
- “Large language models encode clinical knowledge” (Singhal et al., 2022)
- “A guide to deep learning in healthcare” (Esteva et al., 2019)
Efficient Fine-tuning and Multimodal Approaches in Medical AI
- “ChatDoctor: A Medical Chat Model Fine-tuned on a Large Language Model Using Medical Domain Knowledge” (Liu et al., 2023)
- “PMC-LLaMA: Further Finetuning LLaMA on Medical Papers” (Yuan et al., 2023)
- “MedAlpaca: An Open-Source Collection of Medical Conversational AI Models and Training Data” (Chen et al., 2023)
- “MMBERT: Multimodal BERT Pretraining for Improved Medical VQA” (Kumar et al., 2022)
- “MedCLIP: Contrastive Learning from Unpaired Medical Images and Text” (Wu et al., 2023)
- “LoRA for Efficient Medical Image Classification: Less is More” (Wang et al., 2023)
Large Language Models (LLMs)
- “GPT-4 Technical Report” (OpenAI, 2023)
- “PaLM 2 Technical Report” (Anil et al., 2023)
- “Constitutional AI: Harmlessness from AI Feedback” (Bai et al., 2022)
- “Scaling Laws for Neural Language Models” (Kaplan et al., 2020)
Emerging AI Architectures
- “Mamba: Linear-Time Sequence Modeling with Selective State Spaces” (Gu et al., 2023)
- “Towards Learning Universal Hyperparameter Optimizers with Transformers” (Metz et al., 2023)
- “Scaling Vision Transformers to 22 Billion Parameters” (Dehghani et al., 2023)
Ethical AI and Bias Mitigation
- “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜” (Bender et al., 2021)
- “Decolonial AI: Decolonial Theory as Sociotechnical Foresight in Artificial Intelligence” (Mohamed et al., 2020)
- “What Does It Mean to Align AI With Human Values?” (Gabriel, 2020)
LLM Interactions and Adversarial AI
- “Red Teaming Language Models with Language Models” (Perez et al., 2022)
- “Model Inversion Attacks Against GPT-2” (Carlini et al., 2023)
- “Learning to Deceive Large Language Models” (Lin et al., 2023)
- “Scalable Oversight of AI Systems via Selective Amplification” (Christiano et al., 2023)
Prompting and Retrieval
- “How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval” (Wang et al., 2023)
- “Unsupervised Dense Information Retrieval with Contrastive Learning” (Gao & Callan, 2021)
- “Automatic Chain of Thought Prompting in Large Language Models” (Zhang et al., 2022)
- “ReAct: Synergizing reasoning and acting in Language Models” (Yao et al., 2022)
- “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks” (Lewis et al., 2020)
This list is regularly updated based on new publications and research interests of our members.
Find More Papers
- arXiv.org: A free distribution service and an open-access archive for scholarly articles.
- arXiv Sanity Lite: A web interface for browsing, searching, and visualizing arXiv papers.
- Google Scholar: A freely accessible web search engine that indexes the full text or metadata of scholarly literature.
- PubMed Central: A free full-text archive of biomedical and life sciences journal literature.
- Prompting Guide: A curated list of papers on prompting techniques and applications.
We encourage all members to explore these resources and suggest papers for discussion in our upcoming meetings.