apache spark - Why one RDD count job takes so much time -

- September 15, 2013

i used newapihadooprdd() method load hbase records rdd , simple count job.

however, count job takes lots of time far more can imagine. checked codes, thinking may in hbase, 1 column family has data, , when load records rdd, data may cause executors memory overflow.

is possible reason cause issue?

Search This Blog

Look

apache spark - Why one RDD count job takes so much time -

Comments

Post a Comment

Popular posts from this blog

filehandler - java open files not cleaned, even when the process is killed -

Sass watch command compiles .scss files before full sftp upload -

gridview - Yii2 DataPorivider $totalSum for a column -