Natural Language Processing

The source data for these models, as the name suggests, are text: track metadata, news articles, blogs, and other text around the internet.

On a very high level: Spotify crawls the web constantly looking for blog posts and other written text about music to figure out what people are saying about specific artists and songs — which adjectives and what particular language is frequently used in reference to those artists and songs, and which other artists and songs are also being discussed alongside them.

Last updated