This repository contains all of the code for getting the OpenAlex topic classification model up and running. Go into the model iteration directory (v1) to find a more detailed explanation of how to use this repository. To learn more about topics in OpenAlex, check out the docs.
You can find an explanation of the modeling and deployment process at the following link (work-in-progress): OpenAlex: End-to-End Process for Topic Classification
- v1 (current)
Works can be tagged with one or more of 4,516 topics, listed here. Each topic is also assigned a domain, field, and subfield that can be used to group similar topics together at a higher level. There is additional data (keywords, summary, wikipedia url) that can be used to help a user see how the clusters are distinguished and what the top words are in each topic.