RL00 - A glimpse of Reinforcement Learning
This post summarizes reinforcement learning from classic tabular methods to ML-based approximations and recent LLM applications like RLHF.
This post summarizes reinforcement learning from classic tabular methods to ML-based approximations and recent LLM applications like RLHF.
In this article, I will provide a complete walk through of a popular concept in data science interviews - the confidence interval, from its intuition, definition to actual computation.
SQL (Structured Query Language) is the backbone of relational databases. This guide breaks SQL into its five command types—DQL, DML, DDL, DCL, TCL.
In this article, I will give an overall introduction about recommenders, including how the recommendation problem arises, the abstract models and the key problems when building a new recommender.
...In this post, I will introduce the string matching problem, and a clever solution - KMP algorithm. I start from the brute force method and showed how to deduce KMP.
In this post, I would provide an introduction to Knowledge graph, provide information about its definition, construction, storage and application.
In this post, I would summarize key points from classic paper
In this piece of note, I will give an overview of NLP's development history, focusing on how recent neural approaches revolutionise the NLP field.
In this post, I would summarize key points from classic paper <BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding> following its initial structure.
...In this piece of note, I will give an overview of Crypotography, introduce basic principles and algorithms for symmetric cryptography, assymetric cryptography and Protocols.