HTML Utilities

2
html-xml-utils-*.tar.gz A number of simple utilities for manipulating HTML and XML files. See INSTALL for generic installation instructions. Get the source at: http://www.w3.org/Tools/HTML-XML-utils/ htmlutils-*.tar.gz Old versions (before version 0.1) Note: the names changed in version 5.0: most programs got an "hx" prefix. Please, uninstall any version < 5.0 before installing a version >= 5.0 cexport (1) - create headerfile of exported declarations from a C file hxaddid (1) - add ID's to selected elements hxcite (1) - replace bibliographic references by hyperlinks hxcite-mkbib (1) - expand references and create bibliography hxcopy (1) - copy an HTML file while preserving relative links hxcount (1) - count elements and attributes in HTML or XML files hxextract (1) - extract selected elements hxclean (1) - apply heuristics to correct an HTML file hxprune (1) - remove marked elements from an HTML file hxincl (1) - expand included HTML or XML files hxindex (1) - create an alphabetically sorted index hxmkbib (1) - create bibliography from a template hxmultitoc (1) - create a table of contents for a set of HTML files hxname2id - move some ID= or NAME= from A elements to their parents hxnormalize (1) - pretty-print an HTML file hxnum (1) - number section headings in an HTML file hxpipe (1) - convert XML to a format easier to parse with Perl or AWK hxprintlinks (1) - number links & add table of URLs at end of an HTML file hxremove (1) - remove selected elements from an XML file hxtabletrans (1) - transpose an HTML or XHTML table hxtoc (1) - insert a table of contents in an HTML file hxuncdata (1) - replace CDATA sections by character entities hxunent (1) - replace HTML predefined character entities to UTF-8 hxunpipe (1) - convert output of pipe back to XML format hxunxmlns (1) - replace "global names" by XML Namespace prefixes hxwls (1) - list links in an HTML file hxxmlns (1) - replace XML Namespace prefixes by "global names" asc2xml, xml2asc (1) - convert between UTF8 and &#nnn; entities hxref (1) - generate cross-references hxselect (1) - extract elements that match a (CSS) selector This package is configured with automake/autoconf. Generic instructions are in the file INSTALL. Here are some specific problems that may arise: 1) Error when running lex: lex scan.l && mv lex.yy.c scan.c "scan.l":line 2: Error: missing translation value http://www.w3.org/Tools/HTML-XML-utils/README 1 of 2 03/12/2014 02:28 PM

Transcript of HTML Utilities

Page 1: HTML Utilities

html-xml-utils-*.tar.gz A number of simple utilities for manipulating HTML and XML files. See INSTALL for generic installation instructions. Get the source at: http://www.w3.org/Tools/HTML-XML-utils/

htmlutils-*.tar.gz Old versions (before version 0.1)

Note: the names changed in version 5.0: most programs got an "hx"prefix. Please, uninstall any version < 5.0 before installing aversion >= 5.0

cexport (1) - create headerfile of exported declarations from a C filehxaddid (1) - add ID's to selected elementshxcite (1) - replace bibliographic references by hyperlinkshxcite-mkbib (1) - expand references and create bibliographyhxcopy (1) - copy an HTML file while preserving relative linkshxcount (1) - count elements and attributes in HTML or XML fileshxextract (1) - extract selected elementshxclean (1) - apply heuristics to correct an HTML filehxprune (1) - remove marked elements from an HTML filehxincl (1) - expand included HTML or XML fileshxindex (1) - create an alphabetically sorted indexhxmkbib (1) - create bibliography from a templatehxmultitoc (1) - create a table of contents for a set of HTML fileshxname2id - move some ID= or NAME= from A elements to their parentshxnormalize (1) - pretty-print an HTML filehxnum (1) - number section headings in an HTML filehxpipe (1) - convert XML to a format easier to parse with Perl or AWKhxprintlinks (1) - number links & add table of URLs at end of an HTML filehxremove (1) - remove selected elements from an XML filehxtabletrans (1) - transpose an HTML or XHTML tablehxtoc (1) - insert a table of contents in an HTML filehxuncdata (1) - replace CDATA sections by character entitieshxunent (1) - replace HTML predefined character entities to UTF-8hxunpipe (1) - convert output of pipe back to XML formathxunxmlns (1) - replace "global names" by XML Namespace prefixeshxwls (1) - list links in an HTML filehxxmlns (1) - replace XML Namespace prefixes by "global names"asc2xml, xml2asc (1) - convert between UTF8 and &#nnn; entitieshxref (1) - generate cross-referenceshxselect (1) - extract elements that match a (CSS) selector

This package is configured with automake/autoconf. Genericinstructions are in the file INSTALL. Here are some specific problemsthat may arise:

1) Error when running lex:

lex scan.l && mv lex.yy.c scan.c "scan.l":line 2: Error: missing translation value

http://www.w3.org/Tools/HTML-XML-utils/README

1 of 2 03/12/2014 02:28 PM

Page 2: HTML Utilities

The scan.l file uses features of flex that do not exist in lex. However, it is not necessary to run lex, since the file scan.c is provided in the package. Just do a "touch scan.c" to make sure "make" will not try to generate it anew.

$Date: 2014-01-15 15:03:17 $

http://www.w3.org/Tools/HTML-XML-utils/README

2 of 2 03/12/2014 02:28 PM