Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon.

Pages: 1-

grab html data without rss

Name: Anonymous 2007-10-14 15:42

hey nerds.  Hopefully some of you will help me.

I want to get dynamic html text off a webpage that doesnt have an rss feed...can this be done?

Name: Anonymous 2007-10-14 15:46

Yes it can, use your fucking brain.

Name: Anonymous 2007-10-14 17:05

var req=new XMLHttpRequest();
req.open("page.html","get");
req.send();
var page_content=req.responseText;
//do some regex shit on page_content here

Name: Anonymous 2007-10-14 18:01

>>3
OP obviously wants to scrape shit from an remote website. XMLHttpRequest() = local only.

Fuckwit.

Name: Anonymous 2007-10-14 18:10

The libwww-perl library is your friend.

Name: Anonymous 2007-10-14 18:32

Easy but retarded way:

<?php
$fp=fsockopen("whatthefuckever.com",80);
fwrite($fp,"GET /blahblah.html\r\nHost: whatthefuckever.com\r\nConnection: Close\r\n\r\n");
while(!feof($fp)){$shit.=fread($fp,128);}
// do something with $shit
?>

Name: Anonymous 2007-10-14 18:33

>>6
GET request string needs HTTP version in it. OP: Eat a dick.

Name: Anonymous 2007-10-14 18:55

DO NOT HELP HIM!

Name: Anonymous 2007-10-14 19:00


use LWP::Simple qw(get);
my $content = get("http://page.info");
# do stuff

Name: Anonymous 2007-10-14 19:18

>>1
1. Fetch page with web browser.
2. YOU'RE DONE.

Why is /prog/ full of stupid questions lately?

Name: Anonymous 2007-10-14 19:44

1. connect to server
2. GET
3. process response
4. ??????????
5. PROFIT

Name: Anonymous 2007-10-14 21:45

Name: Anonymous 2007-10-15 10:51

Name: Anonymous 2007-10-15 12:34

>>12
1.) It hasn't been implemented
2.) It's fucking retarded.

Name: Anonymous 2011-11-10 11:28

old thread is old

Don't change these.
Name: Email:
Entire Thread Thread List