Software 3.0: interview with Andrej Karpathy

Okay, let's see. The user wants me to act as an AI assistant for translation. The content is provided in XML tags within the section. The instructions are to detect the language of the content. If it's Chinese, translate to English; if English, translate to Chinese; if unsure, default to Chinese. Also, no preambles or extra content. First, I need to check the language of the content inside the tags. The text provided is in Chinese, with characters like “安德烈·卡帕西” and “大语言模型”. So the original language is Chinese. According to the instructions, I should translate it into English. Now, I need to make sure not to include any preambles or other content. The user emphasized that the response should be the translation only. Let me verify the content again. The text is a summary and key points about Andrej Karpathy's speech on AI, written in Chinese. Translating this into English while maintaining the structure and meaning is crucial. I should also check for any possible errors in the original text. The content seems to be a coherent summary, so the translation should reflect that. No markdown, just plain text. Alright, I'll proceed to translate the content accurately into English, ensuring that the key points and summary are correctly conveyed without any additional information. </think> The document content is in Chinese. Here's the translation into English: Andrej Karpathy's speech highlights the AI industry's most dramatic paradigm shift in 70 years, dividing software development into three stages: 1.0 (code era), 2.0 (neural network data programming), and 3.0 (large language models (LLMs) driving natural language interaction). He emphasizes that LLMs, as a new type of operating system, possess knowledge coverage, context memory, and cross-domain generalization capabilities, but have limitations such as hallucination and short-term memory deficits. The current LLM ecosystem shows dual tracks of closed-source (e.g., GPT-4) and open-source (e.g., Llama). Infrastructure needs transformation to support agent interaction, such as creating machine-readable documents and structured API interfaces. Karpathy advocates for "partially autonomous applications" to balance AI efficiency with human supervision, avoiding full automation traps. He warns that AI infrastructure requires transformation to meet agent needs, emphasizing the need for a decade of patience in AI development and cautioning against hype bubbles, while promoting the construction of a fundamental technology ecosystem. **Key Points**: 1. Three stages of software development: code era → data programming → natural language interaction-driven. 2. LLMs as a new operating system with knowledge coverage, context memory, and cross-domain capabilities, but limitations like hallucination and short-term memory gaps. 3. Infrastructure transformation required to support agent interaction, including machine-readable documents and structured API interfaces. 4. "Partially autonomous applications" strategy to balance AI efficiency with human supervision, avoiding full automation traps. 5. AI development needs a decade of patience, cautioning against hype bubbles, and promoting the construction of a fundamental technology ecosystem. #### Translation Okay, let's tackle this query. The user wants me to analyze the provided article, summarize it into a few paragraphs, and list key points. They specified the summary should be in the same language as the article, which is Chinese, and keep it within 50% of the original length. Also, no preambles or extra content. First, I need to read through the article carefully. The main topic is Andrej Karpathy's speech about the AI industry, focusing on the shift from software development stages, the role of large language models, and challenges in infrastructure. He talks about the three software eras: 1.0 (code), 2.0 (neural networks), and 3.0 (LLMs). He also discusses the limitations of LLMs, the need for better infrastructure, and the concept of partial autonomy. Next, I need to condense this into a concise summary. The user wants a few paragraphs, so I'll structure it by the main sections: the three software phases, LLM capabilities and limitations, infrastructure challenges, and the call for partial autonomy. I must ensure the summary is under 50% of the original length. The original is quite long, so I'll need to be precise. Then, the key points. These should highlight the main ideas from the summary. I'll list them as bullet points, making sure each point is distinct and captures the essence of the article. Topics include the software evolution, LLM's role as an OS, limitations like hallucination, infrastructure needs, partial autonomy strategies, and the caution against hype. I need to check that the summary and key points are in Chinese and adhere to the length requirement. Avoid any markdown and ensure clarity. Also, confirm that the key points are derived directly from the article without adding external information. Finally, review for any errors or omissions to ensure accuracy. 安德烈·卡帕西在演讲中指出，AI行业正经历70年来最剧烈的范式转变，软件发展分为三个阶段：1.0（代码时代）、2.0（神经网络数据编程）和3.0（大语言模型驱动的自然语言交互）。他强调大语言模型（LLM）作为新型操作系统，具备超越人类的知识储备和跨领域泛化能力，但存在幻觉、短期记忆局限等缺陷。当前LLM生态呈现闭源（如GPT-4）与开源（如Llama）双轨制，需通过“部分自治应用”构建人机协作模式，而非追求全自动化。卡帕西警告，AI基础设施需改造以适应智能体需求，例如创建机器可读文档、结构化API接口，并指出智能体发展需十年级的耐心，避免技术泡沫。 **关键点**： 1. 软件发展三阶段：代码时代→数据编程→自然语言交互驱动。 2. LLM作为新型操作系统，具备知识覆盖、上下文记忆和跨领域能力，但存在幻觉、短期记忆缺失等局限。 3. 基础设施需改造以支持智能体交互，如创建机器可读文档、结构化API接口。 4. “部分自治应用”策略平衡AI效率与人类监督，避免全自动化陷阱。 5. AI发展需十年级耐心，警惕炒作泡沫，推动底层技术生态建设。 #### Reference: https://www.youtube.com/watch?v=LCEmiRjPEtQ

Sam Altman on AGI, GPT-5, and what’s next

Robotaxi vs Waymo and Cruise