webread is not getting all text on a website
1 view (last 30 days)
Show older comments
Hi all,
I am trying to build a program to get the plain text from a website. The issue I am encountering is that webread does not seem to be collecting all of the text on the site (specifically, the table; see website below). I see my options are twofold:
- query a third-party html-to-plain text website that can do a better job
- determine if there is a workaround that catches more text than the webread method I am using now
here is my code:
html = webread('https://finance.yahoo.com/q/bs?s=MXWL');
txt = regexprep(html,'<script.*?/script>','');
saveTXT(txt,'htmlplaintext');
help is greatly appreciated; I love you guys!
Will
0 Comments
Answers (1)
See Also
Categories
Find more on Environment and Settings in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!