Abstract
Recent years have seen a significant shift in Artificial Intelligence from model-centric to data-centric approaches, highlighted by the success of large foundational models. Following this trend, despite numerous innovations in graph machine learning model design, graph-structured data often suffers from data quality issues, jeopardizing the progress of Data-centric AI in graph-structured applications. Our proposed tutorial addresses this gap by raising awareness about data quality issues within the graph machine-learning community. We provide an overview of existing topology, imbalance, bias, limited data, and abnormality issues in graph data. Additionally, we highlight recent developments in foundational graph models that focus on identifying, investigating, mitigating, and resolving these issues.
| Original language | English (US) |
|---|---|
| Title of host publication | CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management |
| Publisher | Association for Computing Machinery |
| Pages | 5534-5537 |
| Number of pages | 4 |
| ISBN (Electronic) | 9798400704369 |
| DOIs | |
| State | Published - Oct 21 2024 |
| Event | 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 - Boise, United States Duration: Oct 21 2024 → Oct 25 2024 |
Publication series
| Name | International Conference on Information and Knowledge Management, Proceedings |
|---|---|
| ISSN (Print) | 2155-0751 |
Conference
| Conference | 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 |
|---|---|
| Country/Territory | United States |
| City | Boise |
| Period | 10/21/24 → 10/25/24 |
Funding
This research is supported by the National Science Foundation (NSF) under grant number IIS2239881 and Adobe Research.
Keywords
- data-centric artificial intelligence
- graph machine learning
ASJC Scopus subject areas
- General Business, Management and Accounting
- General Decision Sciences