HTSeqCountCluster package

Subpackages

Submodules

HTSeqCountCluster.htseq_count_cluster module

HTSeqCountCluster.htseq_count_cluster.call_htseq(infile, gtf, outfile)[source]

Call the htseq-count script.

Parameters:
  • infile (str) – An alignment file of aligned reads in SAM format.

  • gtf (str) – The gtf (Gene transfer format) file.

  • outfile (str) – The name of the output SAM alignment file.

HTSeqCountCluster.htseq_count_cluster.check_job_status(job_id, email=True)[source]

Use Qstat to monitor your job status.

Parameters:
  • job_id (str) – The job’s id.

  • email (bool, optional) – A flag to decide whether to send email, defaults to True

HTSeqCountCluster.htseq_count_cluster.htseq_jobber(input_path, inputlist, gtf, outpath, email)[source]

Create multiple pbs jobs based on input list of files.

Parameters:
  • input_path ([type]) – [description]

  • inputlist ([type]) – [description]

  • gtf (str) – The gtf (Gene transfer format) file.

  • outpath ([type]) – [description]

  • email (str) – An email address to send notifications.

HTSeqCountCluster.htseq_count_cluster.main()[source]

Main CLI entry point with subcommands for run and merge.

HTSeqCountCluster.mergecounts module

HTSeqCountCluster.mergecounts.main()[source]

Run the merge_counts_tables function.

HTSeqCountCluster.mergecounts.merge_counts_tables(files_dir)[source]

Merge multiple counts tables into 1 counts table.

After running htseq-count-cluster, there will be a counts table for each sample in the output directory. This function will use the genes column as the first column and then insert each subsequent sample name as column header with counts data as the column rows.

Parameters:

files_dir (str) – The directory of the individual counts files.

Module contents