Data Integration and Management
This research sub-field focuses on techniques and methodologies for integrating, managing, and analyzing data from diverse sources while ensuring data quality, privacy, and efficient processing. It encompasses a variety of approaches, including schema matching, data compression, clustering, and stream processing, to address the complexities of modern data environments, including applications in smart cities and health data integration.
129,536 papers
Parent topic: Communication and Signal Processing
AI-assisted content · The overview, paper groupings, and influence analysis on this page are AI-generated. They are intended as a starting point for exploring the field and may contain inaccuracies. Report an error
Sub-topics
Data Quality Assessment and Models
This cluster explores frameworks and models for assessing and managing data quality within datasets. It emphasizes the importance of ensuring that data remains accurate, consistent, and reliable, particularly in large-scale shared environments.
30228 papers
Data Compression Techniques
This area investigates various algorithms and methods for data compression, including both lossless and lossy techniques. The goal is to develop efficient algorithms that reduce data size while maintaining its integrity and utility.
20045 papers
Database Access Methods
Research in this cluster explores various methods and models for accessing databases efficiently. It covers access algorithms that optimize data retrieval processes and examines the structure and abstraction of database systems.
11358 papers
Distributed Database Management
This cluster concerns itself with the management of distributed and replicated database systems. It investigates the techniques and architectures that facilitate the storage, retrieval, and integrity of data across multiple locations.
11037 papers
Data Modeling and Representation
This area investigates various concepts and frameworks for data modeling and structural analysis. The focus is on how data is represented and organized, using models like graphs and other innovative methodologies.
8685 papers
Outlier Detection and Imputation
This area focuses on techniques for identifying outliers within datasets and methods for imputing missing data. It emphasizes the importance of ensuring data completeness and accuracy in data analysis.
8425 papers
Health Data Integration Strategies
This cluster investigates techniques and frameworks for integrating health-related data from various resources. It focuses on leveraging data for health analysis and improving health management systems.
6424 papers
Topological Data Structures
Research in this cluster focuses on the analysis and design of data structures from a topological perspective. It emphasizes understanding data organization through various hierarchical structures for improved efficiency.
5858 papers
Schema Alignment and Adaptation
This cluster focuses on techniques and methodologies for schema matching, aligning data across different formats, and managing schema evolution over time. It encompasses various approaches including automatic and flexible combinations to enhance schema interoperability.
5676 papers
Clustering Techniques and Analysis
This area examines various clustering techniques used for data analysis and pattern recognition. It addresses challenges encountered while clustering diverse datasets and enhances existing methodologies.
4698 papers
Real-time Data Stream Processing
This cluster deals with the processing of data streams in real-time, focusing on the algorithms and frameworks required to analyze continuous data streams efficiently. It underscores the challenges and innovations within streaming data systems.
4077 papers
Data Warehousing and OLAP Systems
This cluster investigates the frameworks and technologies associated with data warehousing and Online Analytical Processing (OLAP). It focuses on best practices for data analysis, storage solutions, and ensuring success in data warehousing initiatives.
3398 papers
Urban Data and Smart Systems
Research in this cluster focuses on the intersection of smart city technologies and cyber-physical systems, emphasizing data integration and analysis for urban environments. This includes the application of big data in enhancing city management and health monitoring.
2962 papers
Scientific Data Management Systems
This area focuses on frameworks and systems designed to manage scientific data effectively. It encompasses updates and methodologies for handling complex datasets often generated in scientific research activities.
2729 papers
Anonymization and Data Integrity
This area explores the methods for anonymizing sensitive data to maintain individual privacy while ensuring data utility. It also investigates detection methods for duplicate and redundant entries in databases.
2200 papers
Data Deduplication Strategies
This cluster focuses on the methodologies for reducing redundancy in datasets through effective deduplication strategies. Research includes practical approaches for system-level deduplication in data storage solutions.
2181 papers
Data Privacy and Clustering Methods
Research in this cluster emphasizes the importance of privacy in data management alongside clustering techniques for data analysis. It explores methods for detecting duplicates as well as techniques for preserving privacy in shared datasets.
2027 papers
Fuzzy Data Processing Techniques
This area delves into methodologies for processing fuzzy data, including fuzzy search algorithms and concepts for data identification. It focuses on improving accuracy and efficiency in searching and analyzing uncertain or imprecise datasets.
1947 papers
High-dimensional Clustering Algorithms
This cluster explores clustering algorithms specifically designed for high-dimensional data. Research includes methods that efficiently partition and analyze complex datasets with numerous features.
1850 papers
Log Analysis and Query Optimization
Research in this cluster predominantly deals with methods for efficiently processing and analyzing data logs, as well as optimizing query techniques for improved performance. It covers algorithms designed for transaction log analysis and other associated heuristics.
1703 papers
Papers Over Time
Top Papers
1970 · 4,690 citations
2009 · 4,556 citations
2005 · 4,320 citations
1976 · 4,289 citations
1978 · 4,067 citations
1959 · 3,583 citations
1977 · 3,468 citations
1997 · 3,092 citations
1968 · 2,669 citations
2009 · 2,461 citations
1969 · 2,424 citations
1996 · 2,420 citations
2003 · 2,242 citations
2011 · 2,207 citations
2016 · 2,104 citations
2018 · 2,058 citations
2007 · 1,999 citations
2015 · 1,909 citations
1980 · 1,826 citations
1990 · 1,745 citations