In this release, we only provide the core version of IsA data we mined from billions of web pages (Other data may release in future step by step). This data contains 5,376,526 unique concepts, 12,501,527 unique instances, and 85,101,174 IsA relations.
The following is its sample data:
You need to take 30 seconds to input some simple information, and then you can download the data:
Data Mining and Enterprise Intelligence Group, MSRA
We would like to acknowledge Haixun Wang, Zhongyuan Wang, Yangqiu Song, Hongsong Li, and many interns for their contributions to the Microsoft Concept Graph and the Microsoft Concept Tagging model. Especially for Haixun Wang, he initiated and led this project when he was at Microsoft Research. We highly appreciate his tremendous contributions and insightful vision which make this project succeed finally.