grab html data without rss
Name:
Anonymous
2007-10-14 15:42
hey nerds. Hopefully some of you will help me.
I want to get dynamic html text off a webpage that doesnt have an rss feed...can this be done?
Name:
Anonymous
2007-10-14 15:46
Yes it can, use your fucking brain.
Name:
Anonymous
2007-10-14 17:05
var req=new XMLHttpRequest();
req.open("page.html","get");
req.send();
var page_content=req.responseText;
//do some regex shit on page_content here
Name:
Anonymous
2007-10-14 18:01
>>3
OP obviously wants to scrape shit from an remote website. XMLHttpRequest() = local only.
Fuckwit.
Name:
Anonymous
2007-10-14 18:10
The libwww-perl library is your friend.
Name:
Anonymous
2007-10-14 18:32
Easy but retarded way:
<?php
$fp=fsockopen("whatthefuckever.com",80);
fwrite($fp,"GET /blahblah.html\r\nHost: whatthefuckever.com\r\nConnection: Close\r\n\r\n");
while(!feof($fp)){$shit.=fread($fp,128);}
// do something with $shit
?>
Name:
Anonymous
2007-10-14 18:33
>>6
GET request string needs HTTP version in it. OP: Eat a dick.
Name:
Anonymous
2007-10-14 18:55
DO NOT HELP HIM!
Name:
Anonymous
2007-10-14 19:00
use LWP::Simple qw(get);
my $content = get("http://page.info");
# do stuff
Name:
Anonymous
2007-10-14 19:18
>>1
1. Fetch page with web browser.
2. YOU'RE DONE.
Why is /prog/ full of stupid questions lately?
Name:
Anonymous
2007-10-14 19:44
1. connect to server
2. GET
3. process response
4. ??????????
5. PROFIT
Name:
Anonymous
2007-10-14 21:45
Name:
Anonymous
2007-10-15 10:51
Name:
Anonymous
2007-10-15 12:34
>>12
1.) It hasn't been implemented
2.) It's fucking retarded.
Name:
Anonymous
2011-11-10 11:28
old thread is old