I have looked around and only found solutions that render a URL to HTML. However I need a way to be able to render a webpage (That I already have, and that has JavaScript) to proper HTML.
Want: Webpage (with JavaScript) ---> HTML
Not: URL --> Webpage (with JavaScript) ---> HTML
I couldn't figure out how to make the other code work the way I wanted.
This is the code I was using that renders URLs: http://webscraping.com/blog/Scraping-JavaScript-webpages-with-webkit/
For clarity, the code above takes a URL of a webpage that has some parts of the page rendered by JavaScript, so if I scrape the page normally using say urllib2 then I won't get all the links etc that are rendered as after the JavaScript.
However I want to be able to scrape a page, say again with urllib2, and then render that page and get the outcome HTML. (Different to the above code since it takes a URL as it's argument.
Any help is appreciated, thanks guys :)
urllib2
code, then I might be able to refer to it with a corresponding Selenium code.urllib2
? In either case you have to send an HTTP request at some point. You can take the text file and feed it into Selenium (or any other scraping utility), but it's not going to be any different than using the URL directly.urllib2
? How is that possible???