AI research, systems, and engineering notes.
Demystifying Agentic Search Engines
Agentic search engines—such as Google AI Mode, Perplexity, Bing Copilot, ChatGPT Search no longer means “type keywords, get ten blue links.” AI Search experience capable of understanding tasks, planning queries, calling tools, and synthesizing results and deliver a conversational response with inline citations, minimizing user effort. In this post, I’ll walk through the stack from bottom to top, how it crawls and indexes pages, how it retrieves and ranks information, and how recent features like RAG and Agentic search build upon these foundations.
- Nov 1, 2025
Design a Modern Recommendation System
Building Modern Recommendation Systems introduces a comprehensive, end-to-end pipeline that drives intelligent recommendations. The post walks through the full machine learning workflow — from raw data preparation and feature engineering to model training, deployment, real-time inference, and system monitoring.
#System Design#RecSys#Recommendation#ML system - Oct 24, 2025
Modeling for Modern Recommendation Systems
This post explores the full RecSys architecture, emphasizing the core models that drive each stage of the RecSys pipeline — from Retrieval for large-scale candidate generation, to Pre-ranking for efficient filtering, Ranking for fine-grained relevance modeling, and Re-ranking for balancing diversity and control.
#System Design#RecSys#Recommendation#ML system - Jun 29, 2025
The ML Factory: Building Production ML Systems
#ML#AI#System Design#ML system - Jul 9, 2025
各领域的深度学习模型
本文简要总结了深度学习在NLP、计算机视觉、信息检索和推荐系统四大主流领域的演进脉络:从早期RNN、CNN等专用模型,到Transformer全面主导,再到如今BERT/GPT、ViT、Diffusion等预训练大模型横扫各领域。核心趋势是预训练+生成式范式取代传统任务特定模型,统一建模与生成式架构正在加速推动各领域融合与新一轮创新。
#模型#AI#NLP#CV - Jan 15, 2025
大模型(LLM)关键技术:从基础到落地
#LLM#AI#大模型#Transformer - Nov 29, 2024
机器学习模型:从传统算法到生成式AI
#ML#AI - Nov 19, 2024
ML 模型生产全流程
#ML#AI#MLops - Oct 21, 2024
NLP技术与应用:从语言理解到智能生成
#NLP#AI - Oct 8, 2024
模型训练的方法与实践
#DL#ML - Aug 7, 2025
深度学习模型架构的演进
本文系统回顾了深度学习的发展脉络,从基础神经网络到Attention 与 Transformer的出现,再到深度生成模型的兴起,最后介绍了多模态与统一建模架构的发展趋势,展示了当前主流的模型体系。
#DL#ML#AI#模型 - May 29, 2024
Retrieval-Augmented Generation (RAG)
Retrieval-Augmented Generation (RAG) combines large language models with external knowledge retrieval to produce more accurate and grounded responses. The post explains why RAG was introduced, explores its key use cases and real-world applications, and discusses challenges and considerations that impact performance in practical deployments.
#LLM#AI#GenAI#NLP