fastqc(1)


NAME

   FastQC - high throughput sequence QC analysis tool

   SYNOPSIS

          fastqc seqfile1 seqfile2 .. seqfileN

          fastqc [-o output dir] [--(no)extract] [-f fastq|bam|sam]

          [-c contaminant file] seqfile1 .. seqfileN

DESCRIPTION

          FastQC  reads a set of sequence files and produces from each one
          a quality control report consisting of  a  number  of  different
          modules,  each  one  of  which will help to identify a different
          potential type of problem in your data.

          If no files to process are specified on the  command  line  then
          the  program will start as an interactive graphical application.
          If files are provided on the command line then the program  will
          run  with  no  user  interaction  required.   In this mode it is
          suitable for inclusion into a standardised analysis pipeline.

          The options for the program as as follows:

   -h --help
          Print this help file and exit

   -v --version
          Print the version of the program and exit

   -o --outdir
          Create all output  files  in  the  specified  output  directory.
          Please  note  that this directory must exist as the program will
          not create it.  If this option is not set then the  output  file
          for  each  sequence file is created in the same directory as the
          sequence file which was processed.

   --casava
          Files come from raw casava output.  Files  in  the  same  sample
          group (differing only by the group number) will be analysed as a
          set rather than individually. Sequences with the filter flag set
          in  the  header  will be excluded from the analysis.  Files must
          have the same names given to them  by  casava  (including  being
          gzipped  and  ending  with  .gz) otherwise they won't be grouped
          together correctly.

   --extract
          If set then the zipped output file will be uncompressed  in  the
          same  directory  after  it  has  been  created.  By default this
          option will be set if fastqc is run in non-interactive mode.

   -j --java
          Provides the full path to the java binary you  want  to  use  to
          launch  fastqc.  If  not  supplied then java is assumed to be in
          your path.

   --noextract
          Do not uncompress the output file after creating it.  You should
          set this option if you do not wish to uncompress the output when
          running in non-interactive mode.

   --nogroup
          Disable grouping of bases for reads >50bp. All reports will show
          data  for  every  base  in the read.  WARNING: Using this option
          will cause fastqc to crash and burn if you use it on really long
          reads,  and  your  plots may end up a ridiculous size.  You have
          been warned!

   -f --format
          Bypasses the normal sequence file format  detection  and  forces
          the  program  to  use  the  specified format.  Valid formats are
          bam,sam,bam_mapped,sam_mapped and fastq

   -t --threads
          Specifies  the  number  of  files   which   can   be   processed
          simultaneously.   Each  thread will be allocated 250MB of memory
          so you shouldn't run more threads  than  your  available  memory
          will cope with, and not more than 6 threads on a 32 bit machine

   -c     Specifies a non-default file which contains the list of

   --contaminants
          contaminants  to  screen overrepresented sequences against.  The
          file must  contain  sets  of  named  contaminants  in  the  form
          name[tab]sequence.  Lines prefixed with a hash will be ignored.

   -k --kmers
          Specifies  the  length  of  Kmer to look for in the Kmer content
          module. Specified Kmer length must be between 2 and 10.  Default
          length is 5 if not specified.

   -q --quiet
          Suppress all progress messages on stdout and only report errors.

BUGS

          Any    bugs   in   fastqc   should   be   reported   either   to
          simon.andrews@babraham.ac.uk                or                in
          www.bioinformatics.babraham.ac.uk/bugzilla/

AUTHOR

          This  manpage  was  created  using  help2man  by  Andreas  Tille
          <tille@debian.org> for the Debian distribution but can  be  used
          by others as well.





Opportunity


Personal Opportunity - Free software gives you access to billions of dollars of software at no cost. Use this software for your business, personal use or to develop a profitable skill. Access to source code provides access to a level of capabilities/information that companies protect though copyrights. Open source is a core component of the Internet and it is available to you. Leverage the billions of dollars in resources and capabilities to build a career, establish a business or change the world. The potential is endless for those who understand the opportunity.

Business Opportunity - Goldman Sachs, IBM and countless large corporations are leveraging open source to reduce costs, develop products and increase their bottom lines. Learn what these companies know about open source and how open source can give you the advantage.





Free Software


Free Software provides computer programs and capabilities at no cost but more importantly, it provides the freedom to run, edit, contribute to, and share the software. The importance of free software is a matter of access, not price. Software at no cost is a benefit but ownership rights to the software and source code is far more significant.


Free Office Software - The Libre Office suite provides top desktop productivity tools for free. This includes, a word processor, spreadsheet, presentation engine, drawing and flowcharting, database and math applications. Libre Office is available for Linux or Windows.





Free Books


The Free Books Library is a collection of thousands of the most popular public domain books in an online readable format. The collection includes great classical literature and more recent works where the U.S. copyright has expired. These books are yours to read and use without restrictions.


Source Code - Want to change a program or know how it works? Open Source provides the source code for its programs so that anyone can use, modify or learn how to write those programs themselves. Visit the GNU source code repositories to download the source.





Education


Study at Harvard, Stanford or MIT - Open edX provides free online courses from Harvard, MIT, Columbia, UC Berkeley and other top Universities. Hundreds of courses for almost all major subjects and course levels. Open edx also offers some paid courses and selected certifications.


Linux Manual Pages - A man or manual page is a form of software documentation found on Linux/Unix operating systems. Topics covered include computer programs (including library and system calls), formal standards and conventions, and even abstract concepts.