How do I scrape a website using MATLAB?
Show older comments
I want to scrape the text data from a table on a website. I have used urlfilter but this only scrapes numbers.
Answers (1)
Image Analyst
on 18 Jul 2014
Edited: Image Analyst
on 18 Jul 2014
1 vote
Use urlread() to get everything.
4 Comments
azizullah khan
on 18 Jul 2014
Image Analyst
on 18 Jul 2014
You mean urlread gets only numbers for you? For me it gets everything, just as if you'd right clicked on the page and said "View Source". I know it's HTML and it will look like a mess and you'll have to parse it out, but what do you expect?
azizullah khan
on 19 Jul 2014
Image Analyst
on 20 Jul 2014
I don't have anything beyond urlread() for that. There is not going to be any built-in code that know exactly how your 200 web pages are laid out and how to extract the table from all the rest of the HTML code that is retrieved. There are billions of web pages out there and they just don't a page parser for all of them, or even yours. You're going to have to do that yourself.
Categories
Find more on Web Services in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!