Vehicle Recommendation using Social media data

Extras

A Tesla automobiles store wants to predict the preferencess of potential buyers so that the right tesla variant can be recommended to maximize the probability of conversion. The data would come from 3 social media profiles of those potential buyers.

Note: Collected social media data collected after users's consent and in compliance with data privacy regulations.

Rule engine

This system will then be fine-tuned and used to recommend teslas based on queries. A business rule engine will be integrated with this system to increase accuracy.

This is how derived rules would be like:

  1. Car or truck or no mention of vehicle type means Cyber Truck
  2. SUV mention means Model X
  3. Mentions of large family or many people means model x

Input and Output

Public datasets

Primary (available for academic use only, need university affiliation for access)

Secondary (low quality data, not sure if can be used at all)

Scope

Model Framework

Model framework 1

  1. Convert user's natural language query into vector using Universal Sentence Embedding model
  2. Create a product specs binary matrix based on different categories
  3. Find TopK similar query vectors using cosine distance
  4. For each TopK vector, Find TopM product specs using interaction table weights
  5. For each TopM specification, find TopN similar specs using binary matrix
  6. Show all the qualified product specifications

Model framework 2

  1. Seed data: 10 users with ground-truth persona, media content and implicit ratings
  2. Inflated data: 10 users with media content and implicit ratings
  3. media content → Implicit rating (A)
  4. media content → feature vector (B) + (A) → weighted pooling → similar users (C)
  5. media content → QA model → slot filling → global pooling → item associations (D)
  6. (C) → content-based filtering → item recommendations → (D) → top-k recommendations

User selection

Model framework 3

User-User Similarity (clustering)

User-Item Similarity (reranking)

User-User Similarity (clustering)

User-Item Similarity (reranking)

Model framework 4

Model framework 5

  1. Topic Clusters Text
  2. Topic Clusters Image
  3. Fetch raw text and images
  4. Combine, Clean and Store text in text dataframe
  5. Vectorize Texts
  6. Cosine similarities of texts with topic clusters
  7. Vectorize Images
  8. Cosine similarities of images with topic clusters

Experiment 1

Experiment 2

Facebook Scraping

Twitter Scraping

Dataframe

Insta Image Grid

User Text NER

Experiment 3

Topic scores

JSON rules

Results and Discussion

References