How to Separte table data from html?

2 views (last 30 days)
Bilal qureshi
Bilal qureshi on 6 Jun 2018
Edited: Paolo on 6 Jun 2018
i am doing web reading using Http POST method with urlread function. how to separate html Table data from html?(str)
ur='https://www.w3schools.com/html/html_tables.asp';
str=urlread(ur)

Accepted Answer

Paolo
Paolo on 6 Jun 2018
Edited: Paolo on 6 Jun 2018
You should be using webread rather than urlread as stated in the documentation. You can use regexp to extract HTML elements from a HTML document. There are examples on how to do so on the regexp documentation page.
Use the following code to extract < table > elements.
ur = 'https://www.w3schools.com/html/html_tables.asp';
str = webread(ur);
%Using urlread instead will give the same output.
%str = urlread(ur);
expression = '<(table).*?</\1>';
matches = regexp(str,expression,'match');
matches contains the two < table > elements in the HTML.

More Answers (0)

Categories

Find more on Data Import and Export in Help Center and File Exchange

Products


Release

R2017a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!