emacs-bug-tracker
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[debbugs-tracker] bug#24858: closed (URGENT: Question about grep)


From: GNU bug Tracking System
Subject: [debbugs-tracker] bug#24858: closed (URGENT: Question about grep)
Date: Wed, 02 Nov 2016 15:50:02 +0000

Your message dated Wed, 2 Nov 2016 10:49:30 -0500
with message-id <address@hidden>
and subject line Re: bug#24858: URGENT: Question about grep
has caused the debbugs.gnu.org bug report #24858,
regarding URGENT: Question about grep
to be marked as done.

(If you believe you have received this mail in error, please contact
address@hidden)


-- 
24858: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=24858
GNU Bug Tracking System
Contact address@hidden with problems
--- Begin Message --- Subject: URGENT: Question about grep Date: Wed, 2 Nov 2016 15:53:45 +0100 User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.3.0

Dear grep developer,

I am Greta Romano and I need your help as soon as possibile.

I want to use grep command to search a string of 6 characters in every line of a file (biological file with DNA nucleotide).

The problem is that I need to search these 6 characters in the first 30 characters of each line. I report you an example:

String to search: GTGTCA

File:

>HWI-ST740:1:C2GCJACXX:1:1101:1279:1825 1:N:0:
NGACGCTCTGACCTTGGGGCTGGTCGGGGATGCTGAGGAGACGGTGACCAGGGTTCCCTGGCCCCACANNNCCAAGCTTCCNNNNNNNNNNNNNNNNNNN
>HWI-ST740:1:C2GCJACXX:1:1101:1349:1847 1:N:0:
NTTAGATGAGGGAAACATCTGCATCAAGTTGTTTATCTGTGACAACAAGTGTTGTTCCACTGCCAAAGAGTTTCTTATAATAAAACAATCGGGGTGGCACNNNNN

I want that the research is done only in the underline characters.  So what I have to add in grep command to put the limit of 30 characters?

Thank you very much

Best regards

Dr. Greta Romano


--- End Message ---
--- Begin Message --- Subject: Re: bug#24858: URGENT: Question about grep Date: Wed, 2 Nov 2016 10:49:30 -0500 User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.4.0
tag 24858 notabug
thanks

On 11/02/2016 09:53 AM, Greta wrote:

> String to search: GTGTCA
> 
> File:
> 
>>HWI-ST740:1:C2GCJACXX:1:1101:1279:1825 1:N:0:
> _/NGACGCTCTGACCTTGGGGCTGGTCGGGG/__A_TGCTGAGGAGACGGTGACCAGGGTTCCCTGGCCCCACANNNCCAAGCTTCCNNNNNNNNNNNNNNNNNNN
> 
>>HWI-ST740:1:C2GCJACXX:1:1101:1349:1847 1:N:0:
> _/NTTAGATGAGGGAAACATCTGCATCAAGTT/__G_TTTATCTGTGACAACAAGTGTTGTTCCACTGCCAAAGAGTTTCTTATAATAAAACAATCGGGGTGGCACNNNNN
> 
> 
> I want that the research is done only in the underline characters.

Underlining doesn't show up in plain text mail (and we prefer plain text
over html bloat on the mailing list).  But I think your point still made
it across

>  So
> what I have to add in grep command to put the limit of 30 characters?

You can't do it with grep.  But you can do it with sed or awk.  Use the
right tool for the job at hand :)

Let's strip your example down to a smaller test case: I want to search
for a one-byte string '1', but only in the first 3 bytes of a file.
With grep, it is not possible; the pattern matches anywhere in the line:

$ printf '012000001\n345000001\n' | grep 1
012000001
345000001

But with sed, we can copy the entire line to hold space, truncate the
line in pattern space, then do the search; if successful, print the line
stored in hold space:

$ printf '012000001\n345000001\n' | \
  sed -n 'h; s/^\(.\{3\}\).*/\1/; /1/ { x;p }'
012000001

And I'll leave the awk program as an exercise for the reader.

Therefore, I'm tagging this as not a bug.

-- 
Eric Blake   eblake redhat com    +1-919-301-3266
Libvirt virtualization library http://libvirt.org

Attachment: signature.asc
Description: OpenPGP digital signature


--- End Message ---

reply via email to

[Prev in Thread] Current Thread [Next in Thread]