
GITHUB . COM {
}
Detected CMS Systems:
- Wordpress (2 occurrences)
Title:
Unexpected trimming behavior Β· Issue #784 Β· marcelm/cutadapt
Description:
Hey! I am playing around with cutadapt to figure out if I can use it for our use case. I ran into the following issue: My target sequence is AAAAAAAAAA and I ran cutadapt agains the sequences shown below that are of the same length but w...
Website Age:
17 years and 8 months (reg. 2007-10-09).
Matching Content Categories {π}
- Video & Online Content
- Real Estate
- Mobile Technology & AI
Content Management System {π}
What CMS is github.com built with?
Github.com uses WORDPRESS.
Traffic Estimate {π}
What is the average monthly size of github.com audience?
ππ Tremendous Traffic: 10M - 20M visitors per month
Based on our best estimate, this website will receive around 10,653,877 visitors per month in the current month.
check SE Ranking
check Ahrefs
check Similarweb
check Ubersuggest
check Semrush
How Does Github.com Make Money? {πΈ}
Subscription Packages {π³}
We've located a dedicated page on github.com that might include details about subscription plans or recurring payments. We identified it based on the word pricing in one of its internal links. Below, you'll find additional estimates for its monthly recurring revenues.How Much Does Github.com Make? {π°}
Subscription Packages {π³}
Prices on github.com are in US Dollars ($).
They range from $4.00/month to $21.00/month.
We estimate that the site has approximately 5,316,156 paying customers.
The estimated monthly recurring revenue (MRR) is $22,327,854.
The estimated annual recurring revenues (ARR) are $267,934,248.
Wordpress Themes and Plugins {π¨}
What WordPress theme does this site use?
It is strange but we were not able to detect any theme on the page.
What WordPress plugins does this website use?
It is strange but we were not able to detect any plugins on the page.
Keywords {π}
aaa, readindel, cutadapt, barcode, adapter, readsnp, marcelm, score, error, gtcaaa, aatgtcaaa, adrianomartinelli, end, inputfastq, alignment, matrix, trimming, issue, start, files, commented, option, read, sign, sequence, readsnpend, output, trim, run, rename, tgtcaaa, edit, debug, unexpected, behavior, aaaaaaaaaa, infofileinfotsv, readsnpstart, infotsv, aaaaaaaaa, owner, anchored, indels, readnoerror, aatagtcaaa, tagtcaaa, errors, initialized, distances, scores,
Topics {βοΈ}
unexpected trimming behavior tsv output read-snp-start {barcode_1}{const_region_1}{barcode_2}{const_region_2}{barcode_3} expected read-snp-start fastq \ --info-file=info ~~~~~~~~~~ read-snp-end read-snp-end ~~~~~~~ read-indel-0--1 ~~~~ ~~~~~~ read-indel-1--1 ~~~~~~~ read-indel-2--1 ~~~~~~~~ read-indel-3--1 read-indel-2--1 read-indel-0--1 read-indel-1--1 read-indel-3--1 read-snp-0--1 ~~~~~~ read-snp-1--1 ~~~~~~ read-snp-2--1 read-snp-2--1 read-snp-1--1 comment metadata assignees taagtcaaa aaaaaaaaat read marcelm closed insights tsv read dynamic programming matrix info file read header ran cutadapt agains command-line parameters adapter trimming indel /dev/null input initialized incorrectly snp tsv file read fastq header info files quick fix fastq input anchored adapter option {adapter_name} 0 error rate issue aatgtcaaa aatgtcaaa initialized correctly f7bf154 sign
Payment Methods {π}
- Braintree
Questions {β}
- Already have an account?
- However, substitutions at the end are treated differently from substitutions at the start?
- It seems like indels at the start are not punished or counted as errors?
Schema {πΊοΈ}
DiscussionForumPosting:
context:https://schema.org
headline:Unexpected trimming behavior
articleBody:Hey!
I am playing around with cutadapt to figure out if I can use it for our use case. I ran into the following issue:
My target sequence is `AAAAAAAAAA` and I ran cutadapt agains the sequences shown below that are of the same length but with a substitution either at the start or the end of the sequence. I would have expected that with an overlap of 9 and 0 error rate both reads in the `input.fastq` are identified. However, substitutions at the end are treated differently from substitutions at the start?
- Cutadapt and Python version: cutadapt 4.8 and Python 3.11.8
- How you installed the tool (conda or pip, for example): pip installed
- Which command-line parameters you used:
```bash
cutadapt -e 0 -O 9 \
-g AAAAAAAAAA \
-o /dev/null input.fastq \
--info-file=info.tsv
```
- My used `input.fastq`
```txt
@read-snp-start
TAAAAAAAAA
+
~~~~~~~~~~
@read-snp-end-
AAAAAAAAAT
+
~~~~~~~~~~
```
- The `info.tsv` output
```txt
read-snp-start -1 TAAAAAAAAA ~~~~~~~~~~
read-snp-end- 0 0 9 AAAAAAAAA T 1 ~~~~~~~~~ ~
```
- The output that you would have expected
```txt
read-snp-start 0 1 10 T AAAAAAAAA 1 ~ ~~~~~~~~~
read-snp-end- 0 0 9 AAAAAAAAA T 1 ~~~~~~~~~ ~
```
Maybe as a general background on what I try to achieve. I have reads structured like:
`{barcode_1}{const_region_1}{barcode_2}{const_region_2}{barcode_3}`
Each `barcode_N` can come from a set of ~100 different barcodes (i.e. the barcode a position `barcode_1` and `barcode_3` might be identical. My goal is to obtain a table indicating how often each barcode combinations was observed during the sequencing. My plan was to sequentially "trim" adapter sequences and using the `info.tsv` file to trace the barcode identifiers. Alternatively, I could maybe use the demultiplexing strategy, but this would lead to thousands of output files.
Any suggestions of how to handle this and insights to the adapter trimming as described above are welcomed.
Best,
Adriano
author:
url:https://github.com/adrianomartinelli
type:Person
name:adrianomartinelli
datePublished:2024-05-04T12:21:31.000Z
interactionStatistic:
type:InteractionCounter
interactionType:https://schema.org/CommentAction
userInteractionCount:5
url:https://github.com/784/cutadapt/issues/784
context:https://schema.org
headline:Unexpected trimming behavior
articleBody:Hey!
I am playing around with cutadapt to figure out if I can use it for our use case. I ran into the following issue:
My target sequence is `AAAAAAAAAA` and I ran cutadapt agains the sequences shown below that are of the same length but with a substitution either at the start or the end of the sequence. I would have expected that with an overlap of 9 and 0 error rate both reads in the `input.fastq` are identified. However, substitutions at the end are treated differently from substitutions at the start?
- Cutadapt and Python version: cutadapt 4.8 and Python 3.11.8
- How you installed the tool (conda or pip, for example): pip installed
- Which command-line parameters you used:
```bash
cutadapt -e 0 -O 9 \
-g AAAAAAAAAA \
-o /dev/null input.fastq \
--info-file=info.tsv
```
- My used `input.fastq`
```txt
@read-snp-start
TAAAAAAAAA
+
~~~~~~~~~~
@read-snp-end-
AAAAAAAAAT
+
~~~~~~~~~~
```
- The `info.tsv` output
```txt
read-snp-start -1 TAAAAAAAAA ~~~~~~~~~~
read-snp-end- 0 0 9 AAAAAAAAA T 1 ~~~~~~~~~ ~
```
- The output that you would have expected
```txt
read-snp-start 0 1 10 T AAAAAAAAA 1 ~ ~~~~~~~~~
read-snp-end- 0 0 9 AAAAAAAAA T 1 ~~~~~~~~~ ~
```
Maybe as a general background on what I try to achieve. I have reads structured like:
`{barcode_1}{const_region_1}{barcode_2}{const_region_2}{barcode_3}`
Each `barcode_N` can come from a set of ~100 different barcodes (i.e. the barcode a position `barcode_1` and `barcode_3` might be identical. My goal is to obtain a table indicating how often each barcode combinations was observed during the sequencing. My plan was to sequentially "trim" adapter sequences and using the `info.tsv` file to trace the barcode identifiers. Alternatively, I could maybe use the demultiplexing strategy, but this would lead to thousands of output files.
Any suggestions of how to handle this and insights to the adapter trimming as described above are welcomed.
Best,
Adriano
author:
url:https://github.com/adrianomartinelli
type:Person
name:adrianomartinelli
datePublished:2024-05-04T12:21:31.000Z
interactionStatistic:
type:InteractionCounter
interactionType:https://schema.org/CommentAction
userInteractionCount:5
url:https://github.com/784/cutadapt/issues/784
Person:
url:https://github.com/adrianomartinelli
name:adrianomartinelli
url:https://github.com/adrianomartinelli
name:adrianomartinelli
InteractionCounter:
interactionType:https://schema.org/CommentAction
userInteractionCount:5
interactionType:https://schema.org/CommentAction
userInteractionCount:5
External Links {π}(3)
Analytics and Tracking {π}
- Site Verification - Google
Libraries {π}
- Clipboard.js
- D3.js
- Lodash
Emails and Hosting {βοΈ}
Mail Servers:
- aspmx.l.google.com
- alt1.aspmx.l.google.com
- alt2.aspmx.l.google.com
- alt3.aspmx.l.google.com
- alt4.aspmx.l.google.com
Name Servers:
- dns1.p08.nsone.net
- dns2.p08.nsone.net
- dns3.p08.nsone.net
- dns4.p08.nsone.net
- ns-1283.awsdns-32.org
- ns-1707.awsdns-21.co.uk
- ns-421.awsdns-52.com
- ns-520.awsdns-01.net