Reading HDFS from Matlab - what toolboxes do I need?
6 views (last 30 days)
Show older comments
We're planning to implement Hadoop at my work, and I need a way to retreive the data from the Hadoop clusters in the data lake and get it into Matlab. What toolboxes do I need for this? Note that I'm only reading the data from HDFS-files.
Additionally, would I need other toolboxes to be able to read data?
0 Comments
Answers (2)
Brandon Eidson
on 11 Sep 2017
Hadoop Sequence Files can be read directly in base MATLAB.
If you want to do "mapreduce" on a Hadoop cluster, then you need to have licenses for the Parallel Computer Toolbox and MATLAB Distributed Computer Server. Documentation on how to Configure a Hadoop cluster and run "mapreduce" on it is linked to below.
0 Comments
Chad Greene
on 11 Sep 2017
The h5read function has come standard since Matlab release 2011a, and requires no special toolboxes.
0 Comments
See Also
Categories
Find more on Deploy Tall Arrays to a Spark Enabled Hadoop Cluster in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!