Added support for Top Alleles, added more report options by mjung2019 · Pull Request #18 · Illumina/BeadArrayFiles

mjung2019 · 2019-06-13T00:42:37Z

added support for Top Alleles
added more options for gtc_final_report.py example
added new example script gtc_final_report_matrix.py which will create matrix reports depending on the Allele type and optional genotype scores. The format is identical to the matrix export option from GenomeStudio.

…final_report script and added a new example script to support building matrix reports`x

KelleyRyanM

See comments

KelleyRyanM · 2019-06-13T15:15:05Z

module/BeadPoolManifest.py

+    Unknown = 0
+    TOP = 1
+    BOT = 2
+    PLUS = 1


These should have different values, otherwise to_string behavior below will be undesirable. If necessary, could generalize get_base_calls generic to take a list of report strands (instead of a single value)

Changed attribute assignments, can now be handled by get_base_calls_generic, see below.

KelleyRyanM · 2019-06-13T15:15:41Z

module/BeadPoolManifest.py

+        Raises:
+            ValueError: Unexpected value for Illumina strand
+        """
+        if ilmn_strand == "U" or ilmn_strand == "":


Shouldn't empty string also raise a ValueError?

Right, empty string is now an exception.

Question though:
for class RefStrand it states:
if ref_strand == "U" or ref_strand == "":
return RefStrand.Unknown

Is that the desired behavior for empty string?

KelleyRyanM · 2019-06-13T15:22:49Z

module/GenotypeCalls.py

+            The genotype basecalls on the report strand as a list of strings.
+            The characters are A, C, G, T, or - for a no-call/null.
+        """
+        return self.get_base_calls_generic(snps, ilmn_strand_annotations, IlmnStrand.TOP, RefStrand.Unknown)


e.g., here could potentially pass
return self.get_base_calls_generic(snps, ilmn_strand_annotations, [IlmnStrand.TOP, IlmnStrand.PLUS], IlmnStrand.Unknown)

Also, should be passing IlmnStrand.Unknown here instead of RefStrand.Unknown. (get_base_calls_forward strand also passes RefStrand.Unknown, but that is probably not the correct behavior there either)

Added list support for get_base_calls_generic. That should solve the ambiguity. The list type for the method is optional in order to leave the other methods untouched. Within the method converting non list types to list before iterating through every allele as list type checks in the inner loop might be something of a performance drain compared to looping through only one element.

Changed to IlmnStrand.Unknown.

KelleyRyanM · 2019-06-13T15:27:46Z

examples/gtc_final_report.py


 try:
-    manifest = BeadPoolManifest(args.manifest)
+    manifest = BeadPoolManifest.BeadPoolManifest(args.manifest)


Does the reference to this import need to change?

Reference should be fine, checked with running gtf_final_report_matrix.py. There is some ambiguity as the module name and class name are the same.

KelleyRyanM · 2019-06-13T15:28:27Z

examples/gtc_final_report_matrix.py

+
+args = parser.parse_args()
+
+if len(sys.argv) != NUM_ARGUMENTS:


The argument parser should be able to handle this error?

In this case yes as only one matrix report is supported per run, also didn't make sense to me to add a default case.

KelleyRyanM · 2019-06-13T15:28:53Z

examples/gtc_final_report_matrix.py

+parser.add_argument("manifest", help="BPM manifest file")
+parser.add_argument("gtc_directory", help="Directory containing GTC files")
+parser.add_argument("output_file", help="Location to write report")
+parser.add_argument("--forward", help="python gtc_final_report_matrix.py <path_to_manifest> <path_to_gtc_directory> <path_to_output_file> --forward 1, print matrix with forward alleles")


Should there be an enumerated format option here?

I don't see any harm in explicitly spelling out the arguments in the parser as long as we are not expecting many more cases. But if you like I can use an enumerator for the options. I would think the trade-off in better readability not that big though.

jjzieve · 2019-06-13T20:09:11Z

examples/gtc_final_report_matrix.py

+parser.add_argument("manifest", help="BPM manifest file")
+parser.add_argument("gtc_directory", help="Directory containing GTC files")
+parser.add_argument("output_file", help="Location to write report")
+parser.add_argument("--forward", help="python gtc_final_report_matrix.py <path_to_manifest> <path_to_gtc_directory> <path_to_output_file> --forward 1, print matrix with forward alleles")


Is there a reason you have to pass a "1" instead of using something like action='store_true' which would make something like args.forward eval to True? Also, why does the "help" have the entire command written, wouldn't that be redundant?

Not really, you can pass action="store_true", default=False to the argument and then you are fine. Ok, I can make this change.

Also, concerning should we write the entire command. Normally I wouldn't bother but I got so many requests how to run commands for folks not familiar with the console, so doesn't hurt to be too obvious.

Ok, added the changes with the latest commit

…into an official release

Added support for Top Alleles, added more report options for the gtc_…

42b585f

…final_report script and added a new example script to support building matrix reports`x

KelleyRyanM suggested changes Jun 13, 2019

View reviewed changes

mjung2019 added 2 commits June 13, 2019 10:06

Added modifications requested by Ryan

515ac5c

minor text changes

844a5c9

jjzieve reviewed Jun 13, 2019

View reviewed changes

mjung2019 added 2 commits June 13, 2019 13:50

parser update, removed duplication bug

37251d8

another report example

af03dda

d-wdj added a commit to d-wdj/BeadArrayFiles that referenced this pull request Aug 25, 2022

Added mods to BeadPoolManifest.py as per Illumina#18 which never got …

de31607

…into an official release

d-wdj added a commit to d-wdj/BeadArrayFiles that referenced this pull request Aug 25, 2022

Added mods to BeadPoolManifest.py as per Illumina#18 which never got …

44ddf14

…into an official release


		args = parser.parse_args()

		if len(sys.argv) != NUM_ARGUMENTS:

Conversation

mjung2019 commented Jun 13, 2019

Uh oh!

KelleyRyanM left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants