咨询客服 咨询客服

A Survey on Geographically Distributed Big-Data Processing Using MapReduce

Abstract:
Hadoop and Spark are widely used distributed processing frameworks for large-scale data processing in an efficient and fault-tolerant manner on private or public clouds. These big-data processing systems are extensively used by many industries, e.g., Google, Facebook, and Amazon, for solving a large class of problems, e.g., search, clustering, log analysis, different types of join operations, matrix multiplication, pattern matching, and social network analysis. However, all these popular systems have a major drawback in terms of locally distributed computations, which prevent them in implementing geographically distributed data processing. The increasing amount of geographically distributed massive data is pushing industries and academia to rethink the current big-data processing systems. The novel frameworks, which will be beyond state-of-the-art architectures and technologies involved in the current system, are expected to process geographically distributed data at their locations without moving entire raw datasets to a single location. In this paper, we investigate and discuss challenges and requirements in designing geographically distributed data processing frameworks and protocols. We classify and study batch processing (MapReduce-based systems), stream processing (Spark-based systems), and SQL-style processing geo-distributed frameworks, models, and algorithms with their overhead issues.
Author Listing: Shlomi Dolev;Patricia Florissi;Ehud Gudes;Shantanu Sharma;Ido Singer
Volume: 5
Pages: 60-80
DOI: 10.1109/TBDATA.2017.2723473
Language: English
Journal: IEEE Transactions on Big Data

IEEE Transactions on Big Data

IEEE T BIG DATA

影响因子:7.5
是否综述期刊:否
是否OA:否
是否预警:不在预警名单内
发行时间:-
ISSN:2332-7790
发刊频率:-
收录数据库:SCIE/Scopus收录
出版国家/地区:UNITED STATES
出版社:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

期刊介绍

年发文量 118
国人发稿量 96
国人发文占比 81.36%
自引率 4.0%
平均录取率 -
平均审稿周期 -
版面费 US$2195
偏重研究方向 Multiple-
期刊官网 -
投稿链接 -

质量指标占比

研究类文章占比 OA被引用占比 撤稿占比 出版后修正文章占比
100.00% 11.22% 0.00% 0.00%

相关指数

影响因子
影响因子
年发文量
自引率
Cite Score

预警情况

时间 预警情况
2024年02月发布的2024版 不在预警名单中
2023年01月发布的2023版 不在预警名单中
2021年12月发布的2021版 不在预警名单中
2020年12月发布的2020版 不在预警名单中

JCR分区 WOS分区等级:Q1区

版本 按学科 分区
WOS期刊SCI分区
(2021-2022年最新版)
COMPUTER SCIENCE, THEORY & METHODS Q1
COMPUTER SCIENCE, INFORMATION SYSTEMS Q1

中科院分区

版本 大类学科 小类学科 Top期刊 综述期刊
计算机科学
2区
COMPUTER SCIENCE, INFORMATION SYSTEMS
计算机:信息系统
2区
COMPUTER SCIENCE, THEORY & METHODS
计算机:理论方法
2区
2021年12月
基础版
工程技术
3区
COMPUTER SCIENCE, INFORMATION SYSTEMS
计算机:信息系统
3区
COMPUTER SCIENCE, THEORY & METHODS
计算机:理论方法
2区
2021年12月
升级版
计算机科学
2区
COMPUTER SCIENCE, INFORMATION SYSTEMS
计算机:信息系统
2区
COMPUTER SCIENCE, THEORY & METHODS
计算机:理论方法
2区
2022年12月
最新升级版
计算机科学
3区
COMPUTER SCIENCE, INFORMATION SYSTEMS
计算机:信息系统
3区
COMPUTER SCIENCE, THEORY & METHODS
计算机:理论方法
2区