Architecture diagram
The following diagram shows the overall framework of the cold start pipeline:

Cold start pipeline workflow
The app requests a list of recommended content from the recommendation service.
The recommendation service calls the
cold start recall moduleto retrieve matching candidate items.The recommendation service calls the
cold start algorithm scoring EAS serviceand passes parameters such as the candidate items to score, user/item features, and the number of items to return.The
cold start algorithm scoring EAS servicegenerates context feature vectors based on thefeature generation configuration file. This process may require retrieving item features from the feature storage system.The
cold start algorithm scoring EAS servicesynchronizes and caches model parameters from themodel parameter storage system, such as a Hologres database.The
cold start algorithm scoring EAS servicecalls the model to calculate a prediction score for each candidate item. This step applies anE&Estrategy to balance exploration and exploitation.The
cold start algorithm scoring EAS serviceselects the top N candidate items based on the scores from the previous step.The
cold start algorithm scoring EAS servicewrites the feature vectors of the top N candidate items to thefeature storage system(backend log storage) in real time, providing data for subsequent model training.The
cold start algorithm scoring EAS servicereturns the top N candidate items to the recommendation service.The recommendation service blends the N cold start items with the regular recommendation list and returns the combined results to the app for display.
The app uploads behavior logs, such as impressions and clicks, to a log server in real time.
A Flink task reads the real-time behavior logs from message middleware, such as Kafka or DataHub.
The Flink task parses the logs, filters the behavior logs for the target scenario based on its configuration, and uses these logs to build samples and train the model.
After a short delay, the Flink task creates sample labels based on the behavior type and retrieves the sample feature vector from the
feature storage system.The Flink task trains the model using online learning and synchronizes the model parameters to the
model parameter storage systemin real time, making the parameters available to thecold start algorithm scoring EAS service.
Data flow view
