AI/NLP 每日情报 - 2026-04-24

📄 arXiv 最新论文 (29 条)

1. Evaluation of Automatic Speech Recognition Using Generative Large Language Models

👤 Thibault Bañeras-Roux, Shashi Kumar, Driss Khalil, Sergio Burdisso et al.

📂 cs.CL

Automatic Speech Recognition (ASR) is traditionally evaluated using Word Error Rate (WER), a metric that is insensitive to meaning. Embedding-based semantic metrics are better correlated with human pe...

📥 PDF

💬 用大模型评估语音识别，开辟了ASR评测新范式，思路清奇。

2. MathDuels: Evaluating LLMs as Problem Posers and Solvers

👤 Zhiqiu Xu, Shibo Jin, Shreya Arya, Mayur Naik

📂 cs.CL, cs.SE

As frontier language models attain near-ceiling performance on static mathematical benchmarks, existing evaluations are increasingly unable to differentiate model capabilities, largely because they ca...

📥 PDF

💬 让LLM既出题又解题，全方位考验数学推理能力，玩法新颖。

3. From Research Question to Scientific Workflow: Leveraging Agentic AI for Science Automation

👤 Bartosz Balis, Michal Orzechowski, Piotr Kica, Michal Dygas et al.

📂 cs.AI

Scientific workflow systems automate execution -- scheduling, fault tolerance, resource management -- but not the semantic translation that precedes it. Scientists still manually convert research ques...

📥 PDF

💬 用AI Agent自动化整个科研流程，从问题到实验一步到位。

4. Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models

👤 Chee Wei Tan, Yuchen Wang, Shangxin Guo

📂 cs.AI

This paper introduces a new paradigm for AI game programming, leveraging large language models (LLMs) to extend and operationalize Claude Shannon's taxonomy of game-playing machines. Central to this p...

📥 PDF

💬 把LLM做成游戏AI，寓教于乐学策略，互动学习好帮手。

5. EVENT5Ws: A Large Dataset for Open-Domain Event Extraction from Documents

👤 Praval Sharma, Ashok Samal, Leen-Kiat Soh, Deepti Joshi

📂 cs.CL

Event extraction identifies the central aspects of events from text. It supports event understanding and analysis, which is crucial for tasks such as informed decision-making in emergencies. Therefore...

📥 PDF

💬 大规模开放域事件抽取数据集，为文档级信息抽取立标杆。

6. TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale

👤 Jun Wang, Ziyin Zhang, Rui Wang, Hang Yu et al.

📂 cs.CL, cs.AI, cs.LG

Real-time detection and mitigation of technical anomalies are critical for large-scale cloud-native services, where even minutes of downtime can result in massive financial losses and diminished user ...

📥 PDF

💬 企业级实时风险事件发现，从海量噪音中精准抓取关键信号。

7. Revisiting Non-Verbatim Memorization in Large Language Models: The Role of Entity Surface Forms

👤 Yuto Nishida, Naoki Shikoda, Yosuke Kishinami, Ryo Fujii et al.

📂 cs.CL

Understanding what kinds of factual knowledge large language models (LLMs) memorize is essential for evaluating their reliability and limitations. Entity-based QA is a common framework for analyzing n...

📥 PDF

💬 重新审视LLM非逐字记忆，揭示实体表层形式的关键作用。

8. Machine Behavior in Relational Moral Dilemmas: Moral Rightness, Predicted Human Behavior, and Model Decisions

👤 Jiseon Kim, Jea Kwon, Luiz Felipe Vecchietti, Wenchao Dong et al.

📂 cs.CL

Human moral judgment is context-dependent and modulated by interpersonal relationships. As large language models (LLMs) increasingly function as decision-support systems, determining whether they enco...

📥 PDF

💬 探究AI在道德困境中的行为，对比人类判断与模型决策。

9. Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models

👤 Naheed Rayhan, Sohely Jahan

📂 cs.CR, cs.AI

Large language models (LLMs) are increasingly integrated into sensitive workflows, raising the stakes for adversarial robustness and safety. This paper introduces Transient Turn Injection(TTI), a new ...

📥 PDF

💬 揭露LLM多轮对话的脆弱性，用瞬态注入攻击暴露出无状态漏洞。

10. TraceScope: Interactive URL Triage via Decoupled Checklist Adjudication

👤 Haolin Zhang, William Reber, Yuxuan Zhang, Guofei Gu et al.

📂 cs.CR, cs.AI

Modern phishing campaigns increasingly evade snapshot-based URL classifiers using interaction gates (e.g., checkbox/slider challenges), delayed content rendering, and logo-less credential harvesters. ...

📥 PDF

💬 交互式URL分类工具，解耦式清单裁决让威胁研判更高效。

11. Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows

👤 Anuj Sadani, Deepak Kumar

📂 cs.AI

The Model Context Protocol (MCP) has become a common interface for connecting large language model (LLM) agents to external tools, but its reliance on stateless, eager schema injection imposes a hidde...

📥 PDF

💬 动态工具门控+懒加载模式，彻底解决Agent工作流中的工具调用开销。

12. Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems

👤 Ye Yu, Heming Liu, Haibo Jin, Xiaopeng Yuan et al.

📂 cs.AI, cs.CL, cs.MA

Multi-agent systems built on large language models have shown strong performance on complex reasoning tasks, yet most work focuses on agent roles and orchestration while treating inter-agent communica...

📥 PDF

💬 端到端优化多智能体语言系统，让AI学会高效沟通协作。

13. SemEval-2026 Task 4: Narrative Story Similarity and Narrative Representation Learning

👤 Hans Ole Hatzel, Ekaterina Artemova, Haimo Paul Stiemer, Evelyn Gius et al.

📂 cs.CL

We present the shared task on narrative similarity and narrative representation learning - NSNRL (pronounced "nass-na-rel"). The task operationalizes narrative similarity as a binary classification pr...

📥 PDF

💬 叙事故事相似度与表示学习新任务，推动语义理解更上一层楼。

14. Who Defines "Best"? Towards Interactive, User-Defined Evaluation of LLM Leaderboards

👤 Minji Jung, Minjae Lee, Yejin Kim, Sarang Choi et al.

📂 cs.AI, cs.CY, cs.HC

LLM leaderboards are widely used to compare models and guide deployment decisions. However, leaderboard rankings are shaped by evaluation priorities set by benchmark designers, rather than by the dive...

📥 PDF

💬 让用户自定义LLM排行榜评价标准，打破“最佳”的单一权威。

15. Thinking with Reasoning Skills: Fewer Tokens, More Accuracy

👤 Guangxiang Zhao, Qilong Shi, Xusen Xiao, Xiangzheng Zhang et al.

📂 cs.AI

Reasoning LLMs often spend substantial tokens on long intermediate reasoning traces (e.g., chain-of-thought) when solving new problems. We propose to summarize and store reusable reasoning skills dist...

📥 PDF

💬 用推理技能指导LLM思考，用更少token换更高准确率。

16. Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs

👤 Joseba Fernandez de Landa, Carla Perez-Almendros, Jose Camacho-Collados

📂 cs.CL, cs.AI, cs.CY

LLMs have been showing limitations when it comes to cultural coverage and competence, and in some cases show regional biases such as amplifying Western and Anglocentric viewpoints. While there have be...

📥 PDF

💬 揭秘LLM的“日本文化”迷思，揭示隐藏的文化与地域偏见。

17. StructMem: Structured Memory for Long-Horizon Behavior in LLMs

👤 Buqiang Xu, Yijun Chen, Jizhan Fang, Ruobin Zhong et al.

📂 cs.CL, cs.AI, cs.IR, cs.LG, cs.MA

Long-term conversational agents need memory systems that capture relationships between events, not merely isolated facts, to support temporal reasoning and multi-hop question answering. Current approa...

📥 PDF

💬 结构化记忆助LLM实现长程行为规划，告别对话健忘症。

18. AEL: Agent Evolving Learning for Open-Ended Environments

👤 Wujiang Xu, Jiaojiao Han, Minghao Guo, Kai Mei et al.

📂 cs.CL, cs.AI, cs.CE

LLM agents increasingly operate in open-ended environments spanning hundreds of sequential episodes, yet they remain largely stateless: each task is solved from scratch without converting past experie...

📥 PDF

💬 智能体在开放环境中自主学习进化，迈向真正的通用智能。

19. From If-Statements to ML Pipelines: Revisiting Bias in Code-Generation

👤 Minh Duc Bui, Xenia Heilmann, Mattia Cerrato, Manuel Mager et al.

📂 cs.CL, cs.SE

Prior work evaluates code generation bias primarily through simple conditional statements, which represent only a narrow slice of real-world programming and reveal solely overt, explicitly encoded bia...

📥 PDF

💬 从if语句到ML管道，回溯代码生成中的偏见问题。

20. Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

👤 Jiali Wei, Ming Fan, Guoheng Sun, Xicheng Zhang et al.

📂 cs.CR, cs.AI, cs.CL

The growing application of large language models (LLMs) in safety-critical domains has raised urgent concerns about their security. Many recent studies have demonstrated the feasibility of backdoor at...

📥 PDF

💬 用自然风格触发器植入后门，LLM安全防御面临新挑战。

21. GS-Quant: Granular Semantic and Generative Structural Quantization for Knowledge Graph Completion

👤 Qizhuo Xie, Yunhui Liu, Yu Xing, Qianzi Hou et al.

📂 cs.AI, cs.CL

Large Language Models (LLMs) have shown immense potential in Knowledge Graph Completion (KGC), yet bridging the modality gap between continuous graph embeddings and discrete LLM tokens remains a criti...

📥 PDF

💬 粒度语义+生成式结构量化，知识图谱补全效果更精准。

22. Multilinguality at the Edge: Developing Language Models for the Global South

👤 Lester James V. Miranda, Songbo Hu, Roi Reichart, Anna Korhonen

📂 cs.CL, cs.CY

Where and how language models (LMs) are deployed determines who can benefit from them. However, there are several challenges that prevent effective deployment of LMs in non-English-speaking and hardwa...

📥 PDF

💬 为全球南方开发边缘端语言模型，让AI普惠更多语言社区。

23. Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

👤 Hao-Yuan Chen

📂 cs.CL, cs.AI

Inference-time scaling for LLM reasoning has focused on three axes: chain depth, sample breadth, and learned step-scorers (PRMs). We introduce a fourth axis, granularity of external verbal supervision...

📥 PDF

💬 通过语言批评进行过程监督，显著提升LLM的推理能力。

24. DryRUN: On the Role of Public Tests in LLM-Driven Code Generation

👤 Kaushitha Silva, Srinath Perera

📂 cs.SE, cs.AI

Multi-agent frameworks are widely used in autonomous code generation and have applications in complex algorithmic problem-solving. Recent work has addressed the challenge of generating functionally co...

📥 PDF

💬 探究公开测试在LLM代码生成中的作用，揭示评估盲区。

25. Language as a Latent Variable for Reasoning Optimization

👤 Linjuan Wu, Haoran Wei, Jialong Tang, Shuang Luo et al.

📂 cs.CL

As LLMs reduce English-centric bias, a surprising trend emerges: non-English responses sometimes outperform English on reasoning tasks. We hypothesize that language functions as a latent variable that...

📥 PDF

💬 将语言视为潜在变量优化推理，思路巧妙且实用。

26. CoFEE: Reasoning Control for LLM-Based Feature Discovery

👤 Maximilian Westermann, Ben Griffin, Aaron Ontoyin Yin, Zakari Salifu et al.

📂 cs.AI, cs.CE, cs.LG

Feature discovery from complex unstructured data is fundamentally a reasoning problem: it requires identifying abstractions that are predictive of a target outcome while avoiding leakage, proxies, and...

📥 PDF

💬 控制LLM推理过程进行特征发现，

27. A Metamorphic Testing Approach to Diagnosing Memorization in LLM-Based Program Repair

👤 Milan De Koning, Ali Asgari, Pouria Derakhshanfar, Annibale Panichella

📂 cs.SE, cs.AI

LLM-based automated program repair (APR) techniques have shown promising results in reducing debugging costs. However, prior results can be affected by data leakage: large language models (LLMs) may m...

📥 PDF

28. Separable Expert Architecture: Toward Privacy-Preserving LLM Personalization via Composable Adapters and Deletable User Proxies

👤 Chris Schneider, Philipp Schoenegger, Ben Bariach

📂 cs.AI, cs.LG

Current model training approaches incorporate user information directly into shared weights, making individual data removal computationally infeasible without retraining. This paper presents a three-l...

📥 PDF

29. Measuring Opinion Bias and Sycophancy via LLM-based Coercion

👤 Rodrigo Nogueira, Giovana Kerche Bonás, Thales Sales Almeida, Andrea Roque et al.

📂 cs.CL

Large language models increasingly shape the information people consume: they are embedded in search, consulted for professional advice, deployed as agents, and used as a first stop for questions abou...

📥 PDF

🤗 HuggingFace 热门论文 (20 条)

30. LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics

👤 Yueyang Ding, HaoPeng Zhang, Rui Dai · 👍 75

Comprehensive understanding of time series remains a significant challenge for Large Language Models (LLMs). Current research is hindered by fragmented task definitions and benchmarks with inherent am...

31. OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

👤 Kanzhi Cheng, Zehao Li, Zheng Ma · 👍 27

Mobile agents powered by vision-language models have demonstrated impressive capabilities in automating mobile tasks, with recent leading models achieving a marked performance leap, e.g., nearly 70% s...

32. Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts

👤 Chaitanya Dwivedi, Binxuan Huang, Himanshu Gupta · 👍 15

Mixture-of-Experts (MoE) has become the dominant architecture for scaling large language models: frontier models routinely decouple total parameters from per-token computation through sparse expert ro...

33. Seeing Fast and Slow: Learning the Flow of Time in Videos

👤 Yen-Siang Wu, Rundong Luo, Jingsen Zhu · 👍 13

How can we tell whether a video has been sped up or slowed down? How can we generate videos at different speeds? Although videos have been central to modern computer vision research, little attention ...

34. Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

👤 Xiyang Wu, Zongxia Li, Guangyao Shi · 👍 10

Long horizon interactive environments are a testbed for evaluating agents skill usage abilities. These environments demand multi step reasoning, the chaining of multiple skills over many timesteps, an...

35. Hybrid Policy Distillation for LLMs

👤 Wenhong Zhu, Ruobing Xie, Rui Wang · 👍 9

Knowledge distillation (KD) is a powerful paradigm for compressing large language models (LLMs), whose effectiveness depends on intertwined choices of divergence direction, optimization strategy, and ...

36. TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale

👤 Jun Wang, Ziyin Zhang, Rui Wang · 👍 9

37. Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

👤 Skylar Zhai, Jingcheng Liang, Dongyeop Kang · 👍 8

Reinforcement fine-tuning improves the reasoning ability of large language models, but it can also encourage them to answer unanswerable queries by guessing or hallucinating missing information. Exist...

38. Image Generators are Generalist Vision Learners

👤 Valentin Gabeur, Shangbang Long, Songyou Peng · 👍 7

Recent works show that image and video generators exhibit zero-shot visual understanding behaviors, in a way reminiscent of how LLMs develop emergent capabilities of language understanding and reasoni...

39. VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

👤 Qijun Han, Haoqin Tu, Zijun Wang · 👍 5

Autonomous GUI agents face two fundamental challenges: early stopping, where agents prematurely declare success without verifiable evidence, and repetitive loops, where agents cycle through the same f...

40. Context Unrolling in Omni Models

👤 Ceyuan Yang, Zhijie Lin, Yang Zhao · 👍 5

We present Omni, a unified multimodal model natively trained on diverse modalities, including text, images, videos, 3D geometry, and hidden representations. We find that such training enables Context ...

41. WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

👤 Juyong Jiang, Chenglin Cai, Chansung Park · 👍 3

While Large Language Models (LLMs) excel at function-level code generation, project-level tasks such as generating functional and visually aesthetic multi-page websites remain highly challenging. Exis...

42. UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection

👤 Yanran Zhang, Wenzhao Zheng, Yifei Li · 👍 3

In recent years, significant progress has been made in both image generation and generated image detection. Despite their rapid, yet largely independent, development, these two fields have evolved dis...

43. Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

👤 Hardy Chen, Nancy Lau, Haoqin Tu · 👍 3

Frontier coding agents are increasingly used in workflows where users supervise progress primarily through repeated improvement of a public score, namely the reported score on a public evaluation file...

44. Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

👤 Vipula Rawte, Ryan Rossi, Franck Dernoncourt · 👍 2

Large Language Models (LLMs) have demonstrated remarkable fluency and versatility across a wide range of NLP tasks, yet they remain prone to factual inaccuracies and hallucinations. This limitation po...

45. COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

👤 Noah Flynn · 👍 2

Large language models (LLMs) often exhibit performance disparities across languages, with naive multilingual fine-tuning frequently degrading performance due to negative cross-lingual interference. To...

46. Streaming Structured Inference with Flash-SemiCRF

👤 Benjamin K. Johnson, Thomas Goralski, Ayush Semwal · 👍 2

Semi-Markov Conditional Random Fields (semi-CRFs) assign labels to segments of a sequence rather than to individual positions, enabling exact inference over segment-level features and principled uncer...

47. Encoder-Free Human Motion Understanding via Structured Motion Descriptions

👤 Yao Zhang, Zhuchenyang Liu, Thomas Ploetz · 👍 1

The world knowledge and reasoning capabilities of text-based large language models (LLMs) are advancing rapidly, yet current approaches to human motion understanding, including motion question answeri...

48. Benign Fine-Tuning Breaks Safety Alignment in Audio LLMs

👤 Jaechul Roh, Amir Houmansadr · 👍 1

Prior work shows that fine-tuning aligned models on benign data degrades safety in text and vision modalities, and that proximity to harmful content in representation space predicts which samples caus...

49. PersonalAI: A Systematic Comparison of Knowledge Graph Storage and Retrieval Approaches for Personalized LLM agents

👤 Mikhail Menschikov, Dmitry Evseev, Victoria Dochkina · 👍 0

Personalizing language models by effectively incorporating user interaction history remains a central challenge in the development of adaptive AI systems. While large language models (LLMs), combined ...

🔥 GitHub 热门项目 (7 条)

50. zilliztech / claude-context

⭐ 8,845 (+706 today) · TypeScript

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

51. Tracer-Cloud / opensre

⭐ 2,821 (+228 today) · Python

Build your own AI SRE agents. The open source toolkit for the AI era ✨

52. Shubhamsaboo / awesome-llm-apps

⭐ 107,319 (+203 today) · Python

100+ AI Agent & RAG apps you can actually run — clone, customize, ship.

53. ZhuLinsen / daily_stock_analysis

⭐ 31,240 (+188 today) · Python

LLM驱动的 A/H/美股智能分析器：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets.

54. microsoft / presidio

⭐ 7,772 (+49 today) · Python

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

55. mlflow / mlflow

⭐ 25,535 (+20 today) · Python

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

56. google / adk-samples

⭐ 8,965 (+17 today) · Python

A collection of sample agents built with Agent Development Kit (ADK)

🤖 HuggingFace 热门模型 (6 条)

57. deepseek-ai/DeepSeek-R1

❤️ 13,307 · 📥 4,011,990 downloads · text-generation

58. meta-llama/Meta-Llama-3-8B

❤️ 6,523 · 📥 3,330,280 downloads · text-generation

59. hexgrad/Kokoro-82M

❤️ 6,057 · 📥 9,668,911 downloads · text-to-speech

60. openai/whisper-large-v3

❤️ 5,623 · 📥 4,904,571 downloads · automatic-speech-recognition

61. bigscience/bloom

❤️ 4,998 · 📥 6,630 downloads · text-generation

62. openai/gpt-oss-120b

❤️ 4,728 · 📥 3,666,745 downloads · text-generation

🚀 AI 新产品 (10 条)

63. BAND

Coordinate and govern multi-agent work in a single chat Discussion | Link

64. Beezi AI

Make AI development structured, secure, and cost-efficient. Discussion | Link

65. Emotional intelligence AI for live calls

Emotional intelligence AI for live sales calls Discussion | Link

66. MailCue

Run as a fully hardened production email server. Discussion | Link

67. TuneJourney.com

AI learns your listening habits and curates your live radio Discussion | Link

68. Ask Product Hunt AI

Find the right product, just ask Discussion | Link

69. Spira AI

AI Influencer that always on trend, create & grow your brand Discussion | Link

70. Nordcraft 2.0

I design agent with full HTML/CSS control and SSR Discussion | Link

71. LifeOS

Turn your AI chats/memory to intros with real humans Discussion | Link

72. Mozart Studio 1.0

A Generative Audio Workstation with VSTs Discussion | Link

📢 AI 行业新闻 (16 条)

73. TechCrunch AI Esther and Anne Wojcicki back new healthcare accelerator, fund

Mary Minno launches early-stage startup accelerator program, Treehub, and an early-stage firm, AI Health Fund, aimed at backing startups working at th

74. OpenAI Blog Introducing GPT-5.5

Introducing GPT-5.5, our smartest model yet—faster, more capable, and built for complex tasks like coding, research, and data analysis across tools.

75. OpenAI Blog GPT-5.5 System Card

76. OpenAI Blog GPT-5.5 Bio Bug Bounty

Explore the GPT-5.5 Bio Bug Bounty: a red-teaming challenge to find universal jailbreaks for bio safety risks, with rewards up to $25,000.

77. OpenAI Blog Making ChatGPT better for clinicians

OpenAI makes ChatGPT for Clinicians free for verified U.S. physicians, nurse practitioners, and pharmacists, supporting clinical care, documentation,

78. OpenAI Blog Workspace agents

Learn how to build, use, and scale workspace agents in ChatGPT to automate repeatable workflows, connect tools, and streamline team operations.

79. OpenAI Blog Speeding up agentic workflows with WebSockets in the Responses API

A deep dive into the Codex agent loop, showing how WebSockets and connection-scoped caching reduced API overhead and improved model latency.

80. OpenAI Blog Introducing workspace agents in ChatGPT

Workspace agents in ChatGPT are Codex-powered agents that automate complex workflows, run in the cloud, and help teams scale work across tools securel

81. OpenAI Blog Introducing OpenAI Privacy Filter

OpenAI Privacy Filter is an open-weight model for detecting and redacting personally identifiable information (PII) in text with state-of-the-art accu

82. Google AI Blog We're launching two specialized TPUs for the agentic era.

The eighth generation of Google’s TPU includes two specialized chips that will power the future of AI.

83. 量子位优必选发布Thinker cosmos：加码开发者生态，推动人形机器人走向规模化

84. 量子位 DeepSeek-V4发布，华为云首发适配

85. 量子位 PPIO首批上线DeepSeek-V4预览版，1M超长上下文能力开箱即用

开源SOTA大模型能力一键直达

86. 量子位 Coordination Engineering关键一环，JiuwenClaw再发布Team Skills技能新范式

业界首个面向多Agent协作的标准化能力包规范

87. 量子位 DeepSeek V4终于发布！打破最强闭源垄断，明确携手华为芯片

在Agent能力、世界知识和推理性能上均实现国内与开源领域的领先。

88. 量子位荣耀WIN游戏本等多款新品正式发布，荣耀PC家族全面爆发

半马冠军荣耀机器人“闪电”惊喜亮相

🧠 AI/NLP 每日情报

📝 今日要点

📄 arXiv 最新论文 (29 条)

🤗 HuggingFace 热门论文 (20 条)

🔥 GitHub 热门项目 (7 条)

🤖 HuggingFace 热门模型 (6 条)

🚀 AI 新产品 (10 条)

📢 AI 行业新闻 (16 条)