Chen Kai Blog

Recommendation Systems (5): Embedding and Representation Learning

permalink: "en/recommendation-systems-5-embedding-techniques/" date: 2024-05-22 09:15:00 tags: - Recommendation Systems - Embedding - Representation Learning categories: Recommendation Systems mathjax: true --- When you browse Netflix, each movie recommendation feels personalized — not just because the algorithm knows your viewing history, but because it has learned dense vector representations (embeddings) that capture subtle relationships between movies, genres, and your preferences. These embeddings transform sparse, high-dimensional user-item interactions into compact, semantically rich vectors that enable efficient similarity search and recommendation.

Embedding techniques form the backbone of modern recommendation systems, from Word2Vec-inspired Item2Vec that treats user sequences as "sentences," to graph-based Node2Vec that captures complex item relationships, to deep two-tower architectures like DSSM and YouTube DNN that learn separate user and item embeddings. These methods solve fundamental challenges: how to represent items and users in a way that preserves their relationships, how to handle millions of items efficiently, and how to learn from implicit feedback signals like clicks and views.

This article provides a comprehensive exploration of embedding techniques for recommendation systems, covering theoretical foundations, sequence-based methods (Item2Vec, Word2Vec), graph-based approaches (Node2Vec), two-tower architectures (DSSM, YouTube DNN), negative sampling strategies, approximate nearest neighbor search (FAISS, Annoy, HNSW), embedding quality evaluation, and practical implementation with 10+ code examples and detailed Q&A sections.

2026-02-03

Recommendation Systems (14): Cross-Domain Recommendation and Cold-Start Solutions

permalink: "en/recommendation-systems-14-cross-domain-cold-start/" date: 2024-07-06 15:45:00 tags: - Recommendation Systems - Cross-Domain - Cold Start categories: Recommendation Systems mathjax: true --- When Netflix launches in a new country, it faces a fundamental challenge: millions of new users with zero interaction history, and thousands of new movies with no ratings. Traditional recommendation systems, trained on historical data, fail catastrophically in this cold-start scenario. Similarly, when Amazon wants to recommend products in a new category (say, recommending books to users who've only bought electronics), it can't rely on cross-category patterns alone. These scenarios — cold-start users, cold-start items, and cross-domain recommendation — represent some of the most critical and challenging problems in modern recommendation systems.

The cold-start problem manifests in three forms: new users with no history, new items with no interactions, and new domains with sparse data. Each requires different strategies: meta-learning that learns to learn quickly from few examples, transfer learning that adapts knowledge from related domains, and bootstrap methods that leverage auxiliary information. Cross-domain recommendation extends these ideas further, transferring patterns learned in one domain (e.g., movies) to another (e.g., books) by identifying shared structures and relationships.

This article provides a comprehensive exploration of cross-domain recommendation and cold-start solutions, covering the taxonomy of cold-start problems, meta-learning foundations and few-shot learning principles, meta-learner architectures (MAML, Prototypical Networks), the Mecos framework for cold-start recommendation, cross-domain transfer learning frameworks, zero-shot transfer methods, graph neural network-based transfer approaches, bootstrap techniques leveraging content and social signals, and practical implementations with 10+ code examples and detailed Q&A sections addressing common challenges and design decisions.

2026-02-03

Recommendation Systems (2): Collaborative Filtering and Matrix Factorization

permalink: "en/recommendation-systems-2-collaborative-filtering/" date: 2024-05-07 14:30:00 tags: - Recommendation Systems - Collaborative Filtering - Matrix Factorization categories: Recommendation Systems mathjax: true --- Imagine you've just finished watching The Shawshank Redemption and want to find a similar movie. Traditional approaches might filter by genre, director, or actors, but collaborative filtering takes a different path — it doesn't care about the movie's attributes. Instead, it observes what "users with similar taste to you also liked." If user A and user B both loved The Shawshank Redemption and Forrest Gump, and user A also enjoyed The Pursuit of Happyness, the system will recommend that movie to user B.

Collaborative filtering is the cornerstone of recommendation systems, from the GroupLens system in the 1990s to the Netflix Prize competition, to today's personalized recommendations on major platforms. Matrix factorization is the mathematical core of collaborative filtering, decomposing the user-item rating matrix into low-dimensional vectors and using vector inner products to predict ratings. This approach solves the data sparsity problem while laying the foundation for deep learning-based recommendation systems.

This article provides an in-depth exploration of collaborative filtering's two paradigms (User-CF and Item-CF), similarity calculation methods, the mathematical principles of matrix factorization, SVD/SVD++ algorithms, ALS optimization methods, BPR ranking learning, FM factorization machines, implicit feedback handling, bias term design, and complete Python implementation code. Each algorithm includes detailed mathematical derivations, code examples, and Q&A sections to help you master recommendation system core algorithms from theory to practice.

2026-02-03

Recommendation Systems (7): Graph Neural Networks and Social Recommendation

permalink: "en/recommendation-systems-7-graph-neural-networks/" date: 2024-06-01 10:30:00 tags: - Recommendation Systems - GNN - Social Recommendation categories: Recommendation Systems mathjax: true --- When Netflix recommends a movie, it's not just looking at what you've watched — it's analyzing a complex web of relationships: which movies are similar to each other, which users share your taste, and how these connections form patterns across millions of interactions. This intricate network of relationships is exactly what graph neural networks (GNNs) excel at modeling. Unlike traditional recommendation approaches that treat users and items as isolated entities, GNNs capture the rich structural information encoded in user-item interaction graphs, social networks, and knowledge graphs.

Graph neural networks have revolutionized recommendation systems by explicitly modeling the graph structure underlying collaborative filtering. From the foundational Graph Convolutional Networks (GCN) that aggregate neighbor information, to Graph Attention Networks (GAT) that learn adaptive importance weights, to GraphSAGE that enables inductive learning on large-scale graphs, GNNs provide a principled framework for leveraging relational data in recommendations. In recommendation systems, these techniques have given rise to models like PinSage (Pinterest's billion-scale recommender), LightGCN (a simplified yet powerful approach), and NGCF (Neural Graph Collaborative Filtering) that achieve state-of-the-art performance by learning high-quality embeddings through graph convolution operations.

This article provides a comprehensive exploration of graph neural networks for recommendation, covering foundational GNN architectures (GCN, GAT, GraphSAGE), graph modeling strategies for recommendation scenarios, key models (PinSage, LightGCN, NGCF), social recommendation techniques, graph sampling methods for scalability, training strategies, implementation details with 10+ code examples, and detailed Q&A sections addressing common questions and practical challenges.

2026-02-03

Recommendation Systems (6): Sequential Recommendation and Session-based Modeling

permalink: "en/recommendation-systems-6-sequential-recommendation/" date: 2024-05-27 14:00:00 tags: - Recommendation Systems - Sequential Recommendation - Session Modeling categories: Recommendation Systems mathjax: true --- When you scroll through TikTok, each video recommendation feels like it knows exactly what you want to watch next — not because it's reading your mind, but because it's analyzing the sequence of videos you've already watched. The order matters: watching a cooking video followed by a travel video suggests different preferences than watching a travel video followed by a cooking video. This temporal dependency is what sequential recommendation captures.

Traditional recommendation systems treat user-item interactions as static snapshots, ignoring the rich temporal patterns hidden in interaction sequences. Sequential recommendation models leverage the order of interactions to predict what users will click, purchase, or engage with next. From Markov chains that model simple transition probabilities, to RNN-based models like GRU4Rec that capture long-term dependencies, to transformer-based architectures like SASRec and BERT4Rec that model complex attention patterns, sequential recommendation has evolved into a cornerstone of modern recommendation systems.

This article provides a comprehensive exploration of sequential recommendation, covering foundational concepts (Markov chains, session-based recommendation), neural sequence models (GRU4Rec, Caser), transformer-based architectures (SASRec, BERT4Rec, BST), graph neural networks for sessions (SR-GNN), implementation details with 10+ code examples, and detailed Q&A sections addressing common questions and challenges.

2026-02-03

Recommendation Systems (16): Industrial Architecture and Best Practices

permalink: "en/recommendation-systems-16-industrial-practice/" date: 2024-07-16 14:00:00 tags: - Recommendation Systems - Industrial Practice - System Architecture categories: Recommendation Systems mathjax: true---

Building production-grade recommendation systems requires navigating a complex landscape of architectural decisions, performance constraints, and business requirements. This article explores the industrial practice of recommendation systems, covering everything from multi-channel recall strategies to deployment pipelines and monitoring infrastructure.

2026-02-03

Recommendation Systems (10): Deep Interest Networks and Attention Mechanisms

permalink: "en/recommendation-systems-10-deep-interest-networks/" date: 2024-06-16 15:15:00 tags: - Recommendation Systems - DIN - Attention Mechanism categories: Recommendation Systems mathjax: true--- When you browse Alibaba's e-commerce platform, the recommendation system doesn't treat all your past clicks equally. That vintage leather jacket you viewed last week matters more when you're looking at similar jackets today than the random phone charger you clicked months ago. This selective focus — understanding which historical behaviors are relevant to the current recommendation — is the core insight behind Deep Interest Networks (DIN), a breakthrough architecture that introduced attention mechanisms to recommendation systems and revolutionized how we model user interests.

Traditional recommendation models treat user behavior sequences as fixed-length vectors, averaging or pooling all historical interactions regardless of their relevance to the current item. DIN changed this paradigm by introducing target attention: dynamically weighting historical behaviors based on their similarity to the candidate item. This simple but powerful idea, combined with Alibaba's massive scale (billions of users, millions of items, terabytes of daily data), led to significant improvements in click-through rates and revenue. The success of DIN spawned a family of attention-based architectures: DIEN (Deep Interest Evolution Network) models how interests evolve over time, DSIN (Deep Session Interest Network) captures session-level patterns, and various attention variants address different aspects of the recommendation problem.

This article provides a comprehensive exploration of Deep Interest Networks and attention mechanisms in recommendation systems, covering the theoretical foundations of attention, DIN's target attention mechanism, DIEN's interest evolution modeling, DSIN's session-aware architecture, attention variants (multi-head, self-attention, co-attention), Alibaba's production practices and optimizations, training techniques for large-scale systems, and practical implementations with 10+ code examples and detailed Q&A sections addressing common questions and challenges.

2026-02-03

Recommendation Systems (11): Contrastive Learning and Self-Supervised Learning

permalink: "en/recommendation-systems-11-contrastive-learning/" date: 2024-06-21 10:00:00 tags: - Recommendation Systems - Contrastive Learning - Self-Supervised categories: Recommendation Systems mathjax: true---

Traditional recommendation systems rely heavily on explicit user feedback — ratings, clicks, purchases — to learn user preferences. But what happens when this data is sparse? What if you're launching a new platform with no historical interactions? Or trying to recommend items that have never been interacted with? These cold-start problems have plagued recommendation systems for decades, limiting their effectiveness and requiring massive amounts of labeled data to achieve reasonable performance.

Contrastive learning offers a paradigm shift. Instead of requiring explicit labels, it learns representations by contrasting similar and dissimilar examples — teaching the model that augmented views of the same user should be close in embedding space, while different users should be far apart. This self-supervised approach has revolutionized computer vision (SimCLR, MoCo), natural language processing (BERT, GPT), and now recommendation systems.

In this comprehensive guide, we'll explore how contrastive learning transforms recommendation systems. We'll start with the fundamental principles of self-supervised learning, dive deep into landmark methods like SimCLR and SGL, examine graph augmentation strategies, explore sequential and long-tail recommendation applications, and provide extensive code examples. Whether you're building a new recommendation system from scratch or improving an existing one, understanding contrastive learning is essential for modern recommendation systems.

2026-02-03

Recommendation Systems (15): Real-Time Recommendation and Online Learning

permalink: "en/recommendation-systems-15-real-time-online/" date: 2024-07-11 10:15:00 tags: - Recommendation Systems - Real-Time - Online Learning categories: Recommendation Systems mathjax: true--- In the era of instant gratification, recommendation systems face an unprecedented challenge: users expect personalized suggestions that adapt to their current interests within milliseconds. Traditional batch-based approaches, which retrain models every few hours or days, simply cannot keep pace with rapidly changing user preferences, trending content, or contextual shifts. This article explores real-time recommendation architectures and online learning algorithms that enable systems to learn and adapt continuously from streaming data, making decisions in real-time while balancing exploration and exploitation.

2026-02-03

Recommendation Systems (12): Large Language Models and Recommendation

permalink: "en/recommendation-systems-12-llm-recommendation/" date: 2024-06-26 14:30:00 tags: - Recommendation Systems - LLM - Large Language Models categories: Recommendation Systems mathjax: true--- When you ask ChatGPT "What movies should I watch if I liked The Matrix?" it doesn't just match keywords — it understands the philosophical themes, visual style, and narrative structure that made The Matrix compelling, then reasons about similar films across genres and decades. This capability represents a paradigm shift in recommendation systems: moving from statistical pattern matching to semantic understanding and reasoning. Large Language Models (LLMs) like GPT-4, Claude, and LLaMA have revolutionized natural language processing, and their integration into recommendation systems is transforming how we understand user preferences, generate explanations, and handle cold-start scenarios.

Traditional recommendation systems excel at finding patterns in user-item interaction matrices, but they struggle with understanding rich textual content, explaining recommendations naturally, and adapting to conversational contexts. LLMs bridge these gaps by bringing world knowledge, semantic understanding, and natural language generation to recommendation. From prompt-based zero-shot recommendation that requires no training data, to LLM-enhanced feature extraction that enriches item representations, to conversational recommenders that engage users in natural dialogue, LLMs are reshaping the recommendation landscape.

This article provides a comprehensive exploration of LLM-powered recommendation systems, covering the fundamental roles LLMs play (as rankers, feature enhancers, and conversational agents), prompt engineering techniques for recommendation tasks, state-of-the-art architectures like A-LLMRec and XRec, conversational recommendation systems (ChatREC, RA-Rec, ChatCRS), token efficiency optimization strategies, and practical implementations with 10+ code examples and detailed Q&A sections addressing common challenges and design decisions.

2026-02-03