rxp(1)


NAME

   rxp - XML parser program

SYNOPSIS

   rxp  [  -abemnNRsStvVx4  ]  [  -o  b|p|0|1|2|3|i|d  ]  [ U 0|1|2 ] [ -c
   encoding ] [ url ]

DESCRIPTION

   rxp reads and parses XML from the url (or standard  input  if  none  is
   provided)  and  writes  it  to  standard  output,  optionally expanding
   entities, defaulting attributes, and translating to a different  output
   encoding.

   rxp  accepts  XML  1.0  and  1.1, and the corresponding versions of XML
   namespaces.  It implements the Oasis XML catalog specification.

   Common option combinations are -Nxs  to  check  a  document  for  well-
   formedness  and  namespace well-formedness, and -VNxs to also check for
   DTD-validity.

OPTIONS

   -a     Insert declared default values for omitted attributes.

   -v     Be verbose.

   -V     Validate the document.  Repeating  this  option  will  make  the
          program  treat  validity  errors  as well-formedness errors, and
          exit after the first validity error (otherwise a warning will be
          printed for each one).

   -d     Read  the  whole DTD (internal and external parts) regardless of
          any   standalone   declaration.    Otherwise    a    declaration
          "standalone='yes'"  will  prevent  the  external part from being
          read (unless validation is selected).

   -N     Enable XML namespace support.  The document will be checked  for
          correct  namespace  syntax,  and  if  -b is specified  qualified
          element and attribute names will be displayed with their URIs.

   -R     The value of this flag is a time limit in seconds,  after  which
          the  program  will abort.  This is to protect against denial-of-
          service attacks using malicious documents.

   -S     Keep track of  xml:space  attributes.   This  will  only  affect
          output when -b is specified.

   -e     Obsolete, do not use.

   -E     Do not expand entity references (opposite of old -e flag)

   -s     Be  silent  (that is, suppress output).  Useful for benchmarking
          or if you just want to see the error messages.

   -b     Print output as "bits".

   -n     Treat the  input  as  normalised  SGML  rather  than  XML.   Not
          intended for general use.

   -o     If  this  flag is p, output is in the default (plain) format. If
          it is b, output is printed as "bits" (equivalent to  -b).     If
          it is 0, output is suppressed (equivalent to -s).  If it is 1, 2
          or 3, output is in first, second or third canonical form.  If it
          is  i,  output is a dump of the document's infoset.  If it is d,
          output is in a form suitable for use with "diff"; in  particular
          attributes are sorted into alphabetical order.

   -m     Merge  PCData  across  entity references.  This will only affect
          the output when -b is specified.

   -t     Read in the input as a tree, rather than bits.  Should  make  no
          difference to the output.

   -u base_uri
          Use the specified base URI when resolving system identifiers.

   -U     This  flag  controls  Unicode normalization checking and is only
          relevant when parsing  XML  1.1  documents.   If  it  is  0,  no
          checking  is  done.  If it is 1, rxp checks that the document is
          fully normalized as defined by the W3C character model.   If  it
          is  2, the document is checked and any unknown characters (which
          may be ones corresponding to a newer version of Unicode than rxp
          knows about) will also cause an error.

   -x     Strict  XML  mode.   This  suppresses  some  warnings (eg entity
          redefinitions) but treats  all  XML  well-formedness  errors  as
          fatal.   This  flag  implies  the  -a  flag, and sets the output
          encoding to UTF-8 unless the -c flag  is  given.   It  sets  the
          output  format  to  first canonical form unless the -o, -b or -s
          flag is given.

   -c encoding
          Produce output  in  the  specified  character  encoding.   Known
          encodings  include  ISO-8859-1, UTF-8, ISO-10646-UCS and UTF-16.
          16-bit encoding names my be suffixed with -B or  -L  to  specify
          big-  or  little-endian byte order (the default is the host byte
          order).  If no -c or -x option is given, output is in  the  same
          encoding as the input document.

   -D name sysid
          Force  use  of  the  document type specified by sysid.  The root
          element name for validation is name.  Any DTD in the document is
          ignored.   This  flag  does  not  imply  validation;  use  -V if
          required.

   -i     Do xml:id processing.  Attributes named xml:id are recognised as
          IDs even if not declared.

   -I     The  same  as  -i, but in addition xml:id attributes are checked
          for uniqueness.

   -z     Use a shorter format for error  messages.   Particularly  useful
          when  using  the parser in Emacs compilation mode, so that Emacs
          can find the error location.

   -4     Use pre-fifth-edition rules for XML 1.0.  XML 1.0 fifth  edition
          extends the set of allowed name characters to match XML 1.1, and
          allows unrecognised version  numbers  of  the  form  1.x  to  be
          treated as 1.0.  the -4 flag disables these changes.

EXIT STATUS

   If the -V flag is given, and the document is well-formed but not valid,
   2 is returned.  If the document is not well-formed, or a  system  error
   occurs,  1 is returned.  Otherwise 0 is returned.  Since the parser can
   expand external entities even when not validating,  it  treats  certain
   errors which are technically validity errors as well-formedness errors.
   If -x is  not  specified,  some  well-formedness  errors  produce  only
   warnings and do not affect the exit status.

ENVIRONMENT

   If  the  environment  variable  XML_CATALOG_FILES  is  set, XML catalog
   processing is enabled.  A catalog can be used to map system and  public
   identifiers  to  local  files.   In  particular,  this allows copies of
   common DTDs to be kept locally, so that rxp does not have to fetch them
   over  the  internet.   XML_CATALOG_FILES  should  be  set  to  a space-
   separated list of catalog files.  The variable  XML_CATALOG_PREFER  may
   be  set  to  public  or  system  to  set  the  initial mode for catalog
   processing; the default is system.

   If the variable RXPURL is set, it is used as the URL of the document to
   parse.   This  may be useful in CGI scripts and the like to avoid shell
   parsing of a user-supplied argument.

   The variable http_proxy can  be  used  to  specify  a  proxy  for  HTTP
   connections.  The syntax is hostname[:port].

                           RXP release 1.4.0                        RXP(1)





Opportunity


Personal Opportunity - Free software gives you access to billions of dollars of software at no cost. Use this software for your business, personal use or to develop a profitable skill. Access to source code provides access to a level of capabilities/information that companies protect though copyrights. Open source is a core component of the Internet and it is available to you. Leverage the billions of dollars in resources and capabilities to build a career, establish a business or change the world. The potential is endless for those who understand the opportunity.

Business Opportunity - Goldman Sachs, IBM and countless large corporations are leveraging open source to reduce costs, develop products and increase their bottom lines. Learn what these companies know about open source and how open source can give you the advantage.





Free Software


Free Software provides computer programs and capabilities at no cost but more importantly, it provides the freedom to run, edit, contribute to, and share the software. The importance of free software is a matter of access, not price. Software at no cost is a benefit but ownership rights to the software and source code is far more significant.


Free Office Software - The Libre Office suite provides top desktop productivity tools for free. This includes, a word processor, spreadsheet, presentation engine, drawing and flowcharting, database and math applications. Libre Office is available for Linux or Windows.





Free Books


The Free Books Library is a collection of thousands of the most popular public domain books in an online readable format. The collection includes great classical literature and more recent works where the U.S. copyright has expired. These books are yours to read and use without restrictions.


Source Code - Want to change a program or know how it works? Open Source provides the source code for its programs so that anyone can use, modify or learn how to write those programs themselves. Visit the GNU source code repositories to download the source.





Education


Study at Harvard, Stanford or MIT - Open edX provides free online courses from Harvard, MIT, Columbia, UC Berkeley and other top Universities. Hundreds of courses for almost all major subjects and course levels. Open edx also offers some paid courses and selected certifications.


Linux Manual Pages - A man or manual page is a form of software documentation found on Linux/Unix operating systems. Topics covered include computer programs (including library and system calls), formal standards and conventions, and even abstract concepts.