hadoop - Data loading into Hive Table by HDFS vs Local Files -


if loading data hdfs hive tables, advantage on loading data local file? if load data hfds hive, isn't data replication in hdfs?

local hdfs slower single huge chunk of data trancefer form local remote n number of nodes.

there replication of data if copy hdfs file hive tables , thats default functionality hive manage own directory, if dont want duplication of data please check answer : is possible import data hive table without copying data


Comments

Popular posts from this blog

html - Outlook 2010 Anchor (url/address/link) -

javascript - Why does running this loop 9 times take 100x longer than running it 8 times? -

Getting gateway time-out Rails app with Nginx + Puma running on Digital Ocean -