Introduction

Business objective - For the given user query, recommend relevant documents (BRM_ifam).

Technical objective: 1-to-N mapping of given input text

Proposed Framework 1 - Hybrid Recommender System

Text → Vector (Universal Sentence Embedding with TF Hub)
Vector → Content-based Filtering Recommendation
Index → Interaction Matrix
Interaction Matrix → Collaborative Filtering Recommendation
Collaborative + Content-based → Hybrid Recommendation
Evaluation: Area-under-curve

Proposed Framework 2 - Content based Recommender System

Find A most similar user → Cosine similarity
For each user in A, find TopK Most Similar Items → Map Argsort
For each item Find TopL Most Similar Items → Cosine similarity
Display
Implement an evaluation metric
Evaluate

Results and Discussion

build.py → this script will take the training data as input and save all the required files in the same working directory
recommend.py → this script will take the user query as input and predict topK BRM recommendations

Variables (during recommendation, you will be asked 2-3 choices, the meaning of those choices are as following)

topK - how many top items you want to get in recommendation
secondary items: this will determine how many similar items you would like to add in consideration, for each primary matching item
sorted by frequency: since multiple input queries might point to same output, therefore this option allows to take that frequence count of outputs in consideration and will move the more frequent items at the top.

Code

https://nbviewer.jupyter.org/gist/sparsh-ai/9198a782ec01b133a26ae28d239a8ffb