I have a set of html pages all of which have the link to the table of contents at the same position (relatively to the beginning of the file). I want to remove these links, how do I do it with bash?
>>9
He was lying. He's got 99 other problems, and now (thanks to >>7 ) a complete lack of knowledge regarding regular expressions isn't one.
Name:
Anonymous2008-09-14 13:13
>>1
First you need to convert the html to XHTML 1.1 Strict. Then you can use the ENTERPRISE XML PARSER Xerces 1.4.4 to manipulate the DOM model and remove the table of contents elements.
OP here. I have solved the problem. There is no need to reply to this thread anymore.
Name:
Anonymous2008-09-14 17:15
Well yeah, I solved the problem as >>16 predicted, but I have one question: how would you tell sed to delete every line that starts with a specified pattern?
Name:
Anonymous2008-09-14 17:38
sed.tell("delete every line that starts with a specified pattern");
┌┴┴┴┴┴┴┴┴┴┴┴┴┴┴┴┐
┤This is an x86 ├
┤microchip!! ├
┤Copy it to help├
┤it proliferate.├
└┬┬┬┬┬┬┬┬┬┬┬┬┬┬┬┘
Name:
Anonymous2008-09-15 15:07
>>17
Do it the Java way: Sed.getInstance().sendMessage(MessageBuilder.getInstance().buildMessage(new BufferedPerformer(new Action('delete', CriterionCouplerFactory.coupleCriterions(new Criterion('line', new CriterionMarker('every')), new Criterion('starts', new CriterionMarker(new Pattern('pattern'))))))))
Name:
Anonymous2008-09-15 15:23
>>23 ... new CriterionMarker(new Pattern('pattern'))))))))
Waiter! There's a cudder in my java!