From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wm1-x32e.google.com (mail-wm1-x32e.google.com [IPv6:2a00:1450:4864:20::32e]) by sourceware.org (Postfix) with ESMTPS id DA031383F844 for ; Tue, 28 Jul 2020 15:32:43 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org DA031383F844 Received: by mail-wm1-x32e.google.com with SMTP id k8so8307661wma.2 for ; Tue, 28 Jul 2020 08:32:43 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=nwkqht7f+JgU7MPaSD5Ao3ARSPnItQZRgmrzD+ODgF4=; b=qs0iD+54G08emffYH7v5Cpdz9H2IvWCDTJsrJJ8dcllMu2ksKPQVuXaf/bF1wCkZaB NMbNO6FmyNVd3t518s2s5qKgVWe+LvjXlYkITnWiuoPM40ZpBz0CXizHL0rSMM1cKjb7 eCjMDgqA9gBlTvnpigkMIXV+wcRYknZYRqGQXU18S+MZzw/laiZiFSofLa8VHlRzDgGX nkWRhoLkKX+t9OE4RHNZpXDrV6yM508OcBGuV+s5XG4VcKx5bTE8Z2Wn0GRSpxU0O2Tk Oc43LOcWXegbDSURlhqxmSXRj4CEYhST5L5phvfGwDzdNNtob2CML5us8Rph+L080wPE mSrA== X-Gm-Message-State: AOAM530N0GKDEb8RA13PGN6qOx7txW65/lP+CWnhoDeTy5CgTw3TCUFV EYhU3msxzSnu45vB5JVAPqahy2pQ1tr+hjaX47codMQ6rEA= X-Google-Smtp-Source: ABdhPJxkepp7IFAa9vs3t9AeuJfBpEj9Ulet1sQRGaoeXM6HERVobpLSccSlQQyEy/IGuUuBPwgoBVnmyC88h9SSLlQ= X-Received: by 2002:a1c:c345:: with SMTP id t66mr4491993wmf.0.1595950362643; Tue, 28 Jul 2020 08:32:42 -0700 (PDT) MIME-Version: 1.0 References: <1729e34a-245c-2e87-6604-bebf8e08d62b@cs.umass.edu> <000601d6643e$7153de00$53fb9a00$@rogers.com> <3609a7fa-6024-2091-f2fb-b3052ab8f051@SystematicSw.ab.ca> In-Reply-To: From: Bryan VanSchouwen Date: Tue, 28 Jul 2020 11:32:30 -0400 Message-ID: Subject: Re: Problem with output from gawk software in recent Cygwin installation To: cygwin@cygwin.com X-Spam-Status: No, score=-0.0 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_ENVFROM_END_DIGIT, FREEMAIL_FROM, HTML_MESSAGE, LOTS_OF_MONEY, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org Content-Type: text/plain; charset="UTF-8" X-Content-Filtered-By: Mailman/MimeDel 2.1.29 X-BeenThere: cygwin@cygwin.com X-Mailman-Version: 2.1.29 Precedence: list List-Id: General Cygwin discussions and problem reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Jul 2020 15:32:45 -0000 Just out of curiosity: Could this "" issue be something new for Windows 10? I ask because I don't recall having this issue with my old Windows 7 computer. Bryan On Tue, Jul 28, 2020 at 12:06 AM Brian Inglis < Brian.Inglis@systematicsw.ab.ca> wrote: > On 2020-07-27 15:58, Bryan VanSchouwen wrote: > > On Mon, Jul 27, 2020 at 4:20 PM Brian Inglis wrote: > >> On 2020-07-27 11:50, Michel LaBarre wrote: > >>> On July 27, 2020 12:52 PM, Eliot Moss wrote: > >>>> On 7/27/2020 11:47 AM, Bryan VanSchouwen wrote: > >>>>> I just tried executing an awk script using the most recent version > >>>>> of gawk, but the output did not turn out the way that it was supposed > >>>>> to. > >>>>> This script uses the following command to print the output data to > >>>>> the output file: > >>>>> print(cai[i], rpi[i], i) > > >>>>> "Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat" > >>>>> and previously, this command always printed the values of the three > >>>>> variables on a single line, separated by spaces; however, now the > >>>>> gawk software is automatically adding hard-returns between the > >>>>> values, resulting in the three values being printed on separate lines > >>>>> within the data file. > >>>>> What is going on here, and how do I permanently make it stop?? > > >>> Here's a wondering: Could it have to do with line endings? If Windows > >>> CRLF is getting in there, then the variables might get a CR in them, > >>> which might do weird things. This assumes those are string variables, > >>> not numeric. > > >> Better yet, how about an example using manifest constants in a one line > >> sample to eliminate impact of arrays or changes in input data as in: > gawk > >> 'BEGIN {print(1,2,3)}' or gawk 'BEGIN {print(1,2,3) > "xxx.txt"}'> > > No problem with awk or gawk: > > $ for ((i = 0; i < 10; ++i)) > > do > > printf "%d %d %d %d\n" $((i+1)) $((i+2)) $((i+3)) $((i+4)) > > done > test.txt > > $ awk '{print($1, $2, $3)}' test.txt > > 1 2 3 > > 2 3 4 > > 3 4 5 > > 4 5 6 > > 5 6 7 > > 6 7 8 > > 7 8 9 > > 8 9 10 > > 9 10 11 > > 10 11 12 > > So the issue appears to be with your command line, script, or input data > > file: please show the command line used to execute the script, attach > the > > complete awk script, and input data file for diagnosis, or selections of > the > > latter piped through or output using cat -A to show control characters. > > Here they are (attached). The script was executed with the following > > command:> gawk -f peak_intensity_correln_plot_compile.awk > Input files have \r\n line terminators and those > are > carried thru at the ends of the string fields: > > $ gawk -f peak_intensity_correln_plot_compile.awk > $ file *cPuMP*.dat > 2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF > line > terminators > cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF > line > terminators > Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text, with CR, LF > line > terminators > $ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head > 1571697^M 1716833^M 224$ > 2672863^M 2894992^M 225$ > 2184902^M 9710015^M 226$ > 4393362^M 4095908^M 227$ > 3828609^M 4218978^M 229$ > 6285045^M 4008320^M 233$ > 3936959^M 4104667^M 234$ > 1698322^M 1942553^M 237$ > 4144791^M 4346435^M 238$ > 2546328^M 2804338^M 239$ > > You could change your input line terminators to "\r\n" e.g. option > -vRS="\r\n", > insert '{ sub( /\r$/, ""); before each 'split(x, s, " ")', convert your > input > fields from strings to numbers by adding zero i.e. cai[i] += 0; rpi[i] += > 0; or > use belts, braces, and suspenders with all three, e.g. > > $ gawk -vRS="\r\n" -f peak_intensity_correln_plot_compile.awk > $ file *cPuMP*.dat > 2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF > line > terminators > cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF > line > terminators > Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text > $ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head > 1571697 1716833 224$ > 2672863 2894992 225$ > 2184902 9710015 226$ > 4393362 4095908 227$ > 3828609 4218978 229$ > 6285045 4008320 233$ > 3936959 4104667 234$ > 1698322 1942553 237$ > 4144791 4346435 238$ > 2546328 2804338 239$ > > -- > Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada > > This email may be disturbing to some readers as it contains > too much technical detail. Reader discretion is advised. > [Data in IEC units and prefixes, physical quantities in SI.] > -- > Problem reports: https://cygwin.com/problems.html > FAQ: https://cygwin.com/faq/ > Documentation: https://cygwin.com/docs.html > Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple >