/prog/ - grab html data without rss

Name: Anonymous 2007-10-14 15:42

hey nerds. Hopefully some of you will help me.

I want to get dynamic html text off a webpage that doesnt have an rss feed...can this be done?

Name: Anonymous 2007-10-14 15:46

Yes it can, use your fucking brain.

Name: Anonymous 2007-10-14 17:05

var req=new XMLHttpRequest();

req.open("page.html","get");

req.send();

var page_content=req.responseText;

//do some regex shit on page_content here

Name: Anonymous 2007-10-14 18:01

>>3
OP obviously wants to scrape shit from an remote website. XMLHttpRequest() = local only.

Fuckwit.

Name: Anonymous 2007-10-14 18:10

The libwww-perl library is your friend.

Name: Anonymous 2007-10-14 18:32

Easy but retarded way:



<?php

$fp=fsockopen("whatthefuckever.com",80);

fwrite($fp,"GET /blahblah.html\r\nHost: whatthefuckever.com\r\nConnection: Close\r\n\r\n");

while(!feof($fp)){$shit.=fread($fp,128);}

// do something with $shit

?>

Name: Anonymous 2007-10-14 18:33

>>6
GET request string needs HTTP version in it. OP: Eat a dick.

Name: Anonymous 2007-10-14 18:55

DO NOT HELP HIM!

Name: Anonymous 2007-10-14 19:00



use LWP::Simple qw(get);

my $content = get("http://page.info");

# do stuff

Name: Anonymous 2007-10-14 19:18

>>1
1. Fetch page with web browser.
2. YOU'RE DONE.

Why is /prog/ full of stupid questions lately?

Name: Anonymous 2007-10-14 19:44

1. connect to server
2. GET
3. process response
4. ??????????
5. PROFIT

Name: Anonymous 2007-10-14 21:45

>>4
http://wiki.mozilla.org/Cross_Site_XMLHttpRequest

Name: Anonymous 2007-10-15 10:51

You best be jokin' Nigga.

http://www.w3.org/Protocols/rfc2616/rfc2616.html

Name: Anonymous 2007-10-15 12:34

>>12
1.) It hasn't been implemented
2.) It's fucking retarded.

Name: Anonymous 2011-11-10 11:28

old thread is old

grab html data without rss

1 Name: Anonymous 2007-10-14 15:42

2 Name: Anonymous 2007-10-14 15:46

3 Name: Anonymous 2007-10-14 17:05

4 Name: Anonymous 2007-10-14 18:01

5 Name: Anonymous 2007-10-14 18:10

6 Name: Anonymous 2007-10-14 18:32

7 Name: Anonymous 2007-10-14 18:33

8 Name: Anonymous 2007-10-14 18:55

9 Name: Anonymous 2007-10-14 19:00

10 Name: Anonymous 2007-10-14 19:18

11 Name: Anonymous 2007-10-14 19:44

12 Name: Anonymous 2007-10-14 21:45

13 Name: Anonymous 2007-10-15 10:51

14 Name: Anonymous 2007-10-15 12:34

15 Name: Anonymous 2011-11-10 11:28