Blog Archive
Other
- April 2025 - DeepSeek-GRM: Generalist Reward Modeling
- April 2025 - Pieter Abbeel Keynote at GTC2025: AI for Humanoid Robots
- March 2025 - Tracing the thoughts of LLM (by Anthropic)
- March 2025 - Deepseek0V3-0324 Gemini2.5-pro GPT-4o
- March 2025 - Think-Then-React TTR
- March 2025 - Why Cant AI Make Its Own Discoveries? — With Yann LeCun
- March 2025 - An opinion about MCP
- March 2025 - Jensen Huang keynotes at Nvidia GTC 2025
- March 2025 - An interview with Sam Altman - building a consumer tech company
- March 2025 - Scaling Laws for DiLoCo by Google
- March 2025 - Transformers without Normalization by DyT (dynamic Tanh)
- March 2025 - MCP explained
- March 2025 - Google open-source Gemma 3
- March 2025 - ULTRA-SPARSE MEMORY NETWORK
- March 2025 - Hard Fork interview Dario Amodei
- March 2025 - The Danger of Overthinking
- March 2025 - Huggingface ultrascale playbook
- March 2025 - Deepseek open-source-week summary
- February 2025 - Snowflakes CEO Sridhar Ramaswamy interview
- February 2025 - Deepseek-R2 outlook
- February 2025 - Claude 3.7 Sonnet
- February 2025 - FigureAI’ Helix - VLA model
- February 2025 - Grok 3 and Grok 3-mini
- February 2025 - Jeff Dean and Noam Shazeer interview - Google AGI outlook
- February 2025 - DeepSeek Native Sparse Attention (NSA)
- February 2025 - Sam Altman: GPT-4.5 andGPT5
- February 2025 - GPT-4.5 and GPT 5 outlook
- February 2025 - Sam Altman: three observations
- February 2025 - Qwen2.5-Max model
- February 2025 - Big idea 2025
- January 2025 - DeepSeek Janus Pro model
- January 2025 - Demis Hassabis interview - reach AGI in 3-5 years
- January 2025 - Interview with DeepSeek founder Liang Wenfeng
- January 2025 - OpenAI Operator launch
- January 2025 - Minimax-Text-01 and Minimax-VL-01
- January 2025 - DeepSeek-R1 reasoning model
- January 2025 - Google’s Titans vs Transformers
- January 2025 - The unbearable slowness of being
- January 2025 - AI trend 2025 [A]
- January 2025 - How difficult is AI alignment
- January 2025 - Why RLHF is not True RL - Atlas Wang
- January 2025 - Nvidia Cosmos: WFM platform
- January 2025 - Rich Sutton DAI 2024 Speech
- January 2025 - About LLM scaling law - Jason Wei @OpenAI
- December 2024 - DeepSeek V3 Tech Report
- December 2024 - Google Deep Research
- December 2024 - AI trend 2025 - Rob Toews from Radical Ventures
- December 2024 - OpenAI o3 model for reasoning
- December 2024 - OpenAI o3 model
- December 2024 - Sequoia: AI in 2025
- December 2024 - OpenAI o3 model - Arc AGI
- December 2024 - Ilya Sutskever: pre-training is over
- December 2024 - OpenAI o1 pro architecture
- December 2024 - The rise of small language model (SLMs)
- December 2024 - Google Gemini 2.0 Flash
- December 2024 - Hinton Nobel Prize speech
- December 2024 - Rich Roll and Yuval Noah Harari interview: AI vs human
- December 2024 - Mamba vs Transformers
- December 2024 - I-JEPA: A Human-Like world Model by Yann Lecun
- December 2024 - World Labs: 3D world generation model
- December 2024 - NeurIPS best paper award (Tian et.al.)
- December 2024 - Reward hacking in RLHF
- December 2024 - Claude MCP
- November 2024 - Jeff Dean ‘s reaction to AlphaChip
- November 2024 - Yann LeCun’s interview with Nikhil Kamath
- November 2024 - DeepMind AI report: A new golden age of discovery
- November 2024 - SoT: Can Stories Help LLMs Reason? Curating Information Space Through Narrative
- November 2024 - Richard Sutton: reinforcement learning and continuous training
- November 2024 - Andrew Ng: The Rise Of AI Agents And Agentic Reasoning
- November 2024 - Can Test-Time-Training (TTT) fix the Scaling law ceiling
- November 2024 - Mustafa Suleyman vs Reid Hoffman @Masters of Scale Summit: distillation, AI agents, redefine hallucination
- November 2024 - Q-star 2.0 unlocks new scaling law
- November 2024 - Anthropic 5-hour interview by Lex Fridman
- November 2024 - Kevin Weil (OpenAI) vs Mike Kreiger (Anthropic): a CPO conversation
- November 2024 - How Far is Video Generation from World Model: A Physical Law Perspective
- November 2024 - Sam Altman’s interview by Harry Stebbings
- November 2024 - Ben Horowitz speecg @FII Summit
- November 2024 - Yann LeCun: LLM can never reach AGI
- October 2024 - AI in the next 10 years
- October 2024 - Apple paper: GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
- October 2024 - LightRAG:cost-performant GraphRAG
- October 2024 - Terence Tao: About AI
- October 2024 - Domain specific AI agents (DXA) will shape the industrial world in the next 10 years
- October 2024 - Nature: LLM scaling law
- October 2024 - OpenAI Canvas
- October 2024 - OpenAI DevDay 2024
- October 2024 - Liquid-40B (MIT)
- September 2024 - Feifei Li vs Justin Johnson: unveils the next frontier of AI - LWM
- September 2024 - OpenAI o1: post-training scaling law
- September 2024 - Andrew Lo (MIT): LLM shaping Finance
- September 2024 - Jeff Dean: Google Gemini
- September 2024 - Li Mu speech@SHJU: LLM present and future
- August 2024 - Andew Ng interview with Ark Invest on Agentic Workflow
- August 2024 - DeepMind Gemma-2-2B model
- July 2024 - Agentic RAG makes chatting with docs smarter
- July 2024 - Self-Taught Reasoning (STaR) powers LLM
- July 2024 - Jerry Liu: Agentic RAG
- July 2024 - About AI Agent (2024)
- July 2024 - OpenAI: CriticGPT
- May 2024 - Feifei Li@TED: Spatial Intelligence and LWM
- April 2024 - Agentic reasoning - Andrew Ng
- February 2024 - Geoffrey Hinton vs Feifei Li: Responsible AI
- February 2024 - Jeff Dean: trends of ML
- February 2024 - Andrew Ng’s interview on AI’s potential effect @WSJ
- January 2024 - Operator - OpenAI agent
- January 2024 - Neda: program synthesis
- January 2024 - AI in 2024
- December 2023 - Weak to strong generalization
- September 2023 - LLM based agents: a survey
- July 2023 - LLM powered autonomous agents - by Lilian Weng
- June 2023 - Emily Chang interview Mira Murati at OpenAI
- April 2023 - Lu Qi speech on LLM