[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [bug-gawk] 4.7 Defining Fields by Content
From: |
Davide Brini |
Subject: |
Re: [bug-gawk] 4.7 Defining Fields by Content |
Date: |
Mon, 14 Mar 2016 22:58:26 +0100 |
On Mon, 14 Mar 2016 14:36:49 -0400, "Andrew J. Schorr"
<address@hidden> wrote:
> On Mon, Mar 14, 2016 at 05:28:36PM +0100, Marco Coletti wrote:
> > Of course linefeeds are still a problem.
> > RS cannot differentiate between linefeeds inside and outside of quotes.
> > A workaround could be to terminate records with "#\"\n" then set
> > RS="#\"\n", since this 3 characters sequence cannot possibly appear
> > inside fields. Of course this is not RFC 4180 anymore.
> >
> > Unfortunately my C/C++ skills are very rusted!
>
> OK. Eventually somebody will volunteer to write the C code to solve this
> problem properly.
Meanwhile there are awk CSV parsers here (that I know of, there may be
more):
http://lorance.freeshell.org/csv/
https://github.com/dualbus/awk/tree/master/csvawk
--
D.