From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp-out-no.shaw.ca (smtp-out-no.shaw.ca [64.59.134.13]) by sourceware.org (Postfix) with ESMTPS id 0ECD53857C49 for ; Tue, 28 Jul 2020 04:05:18 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 0ECD53857C49 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=SystematicSw.ab.ca Authentication-Results: sourceware.org; spf=none smtp.mailfrom=brian.inglis@systematicsw.ab.ca Received: from [192.168.1.104] ([24.64.172.44]) by shaw.ca with ESMTP id 0Grck7E75ng7K0GrdkWeD9; Mon, 27 Jul 2020 22:05:17 -0600 X-Authority-Analysis: v=2.3 cv=ecemg4MH c=1 sm=1 tr=0 a=kiZT5GMN3KAWqtYcXc+/4Q==:117 a=kiZT5GMN3KAWqtYcXc+/4Q==:17 a=IkcTkHD0fZMA:10 a=RZ24vCjvlsqmbDxLIRQA:9 a=7Zwj6sZBwVKJAoWSPKxL6X1jA+E=:19 a=QEXdDO2ut3YA:10 Reply-To: cygwin@cygwin.com Subject: Re: Problem with output from gawk software in recent Cygwin installation To: cygwin@cygwin.com References: <1729e34a-245c-2e87-6604-bebf8e08d62b@cs.umass.edu> <000601d6643e$7153de00$53fb9a00$@rogers.com> <3609a7fa-6024-2091-f2fb-b3052ab8f051@SystematicSw.ab.ca> From: Brian Inglis Autocrypt: addr=Brian.Inglis@SystematicSw.ab.ca; prefer-encrypt=mutual; keydata= mDMEXopx8xYJKwYBBAHaRw8BAQdAnCK0qv/xwUCCZQoA9BHRYpstERrspfT0NkUWQVuoePa0 LkJyaWFuIEluZ2xpcyA8QnJpYW4uSW5nbGlzQFN5c3RlbWF0aWNTdy5hYi5jYT6IlgQTFggA PhYhBMM5/lbU970GBS2bZB62lxu92I8YBQJeinHzAhsDBQkJZgGABQsJCAcCBhUKCQgLAgQW AgMBAh4BAheAAAoJEB62lxu92I8Y0ioBAI8xrggNxziAVmr+Xm6nnyjoujMqWcq3oEhlYGAO WacZAQDFtdDx2koSVSoOmfaOyRTbIWSf9/Cjai29060fsmdsDLg4BF6KcfMSCisGAQQBl1UB BQEBB0Awv8kHI2PaEgViDqzbnoe8B9KMHoBZLS92HdC7ZPh8HQMBCAeIfgQYFggAJhYhBMM5 /lbU970GBS2bZB62lxu92I8YBQJeinHzAhsMBQkJZgGAAAoJEB62lxu92I8YZwUBAJw/74rF IyaSsGI7ewCdCy88Lce/kdwX7zGwid+f8NZ3AQC/ezTFFi5obXnyMxZJN464nPXiggtT9gN5 RSyTY8X+AQ== Organization: Systematic Software Message-ID: Date: Mon, 27 Jul 2020 22:05:16 -0600 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-CA Content-Transfer-Encoding: 7bit X-CMAE-Envelope: MS4wfLoo+QJbZM4IKFFV6XOPAJUfDbR4nHZN0GzVhyrX063bp0t2hmdiXe2MvWEI0xvxjGRGWL61jf8w3aeqX+/XCW3QvkhCZ316r8aM5W2k4S6OStMvzQwN DNcuWNlWYYNRPISWL/cKUhooe1KSWB0d+IrPDSkP4otfsHU+PMBNp0SJ75w0/qNTqXLG8cc4Ct5Dkw== X-Spam-Status: No, score=-1.0 required=5.0 tests=BAYES_00, KAM_DMARC_STATUS, KAM_LAZY_DOMAIN_SECURITY, LOTS_OF_MONEY, RCVD_IN_DNSWL_LOW, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=no autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: cygwin@cygwin.com X-Mailman-Version: 2.1.29 Precedence: list List-Id: General Cygwin discussions and problem reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Jul 2020 04:05:20 -0000 On 2020-07-27 15:58, Bryan VanSchouwen wrote: > On Mon, Jul 27, 2020 at 4:20 PM Brian Inglis wrote: >> On 2020-07-27 11:50, Michel LaBarre wrote: >>> On July 27, 2020 12:52 PM, Eliot Moss wrote: >>>> On 7/27/2020 11:47 AM, Bryan VanSchouwen wrote: >>>>> I just tried executing an awk script using the most recent version >>>>> of gawk, but the output did not turn out the way that it was supposed >>>>> to. >>>>> This script uses the following command to print the output data to >>>>> the output file: >>>>> print(cai[i], rpi[i], i) > >>>>> "Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat" >>>>> and previously, this command always printed the values of the three >>>>> variables on a single line, separated by spaces; however, now the >>>>> gawk software is automatically adding hard-returns between the >>>>> values, resulting in the three values being printed on separate lines >>>>> within the data file. >>>>> What is going on here, and how do I permanently make it stop?? >>> Here's a wondering: Could it have to do with line endings? If Windows >>> CRLF is getting in there, then the variables might get a CR in them, >>> which might do weird things. This assumes those are string variables, >>> not numeric. >> Better yet, how about an example using manifest constants in a one line >> sample to eliminate impact of arrays or changes in input data as in: gawk >> 'BEGIN {print(1,2,3)}' or gawk 'BEGIN {print(1,2,3) > "xxx.txt"}'> > No problem with awk or gawk: > $ for ((i = 0; i < 10; ++i)) > do > printf "%d %d %d %d\n" $((i+1)) $((i+2)) $((i+3)) $((i+4)) > done > test.txt > $ awk '{print($1, $2, $3)}' test.txt > 1 2 3 > 2 3 4 > 3 4 5 > 4 5 6 > 5 6 7 > 6 7 8 > 7 8 9 > 8 9 10 > 9 10 11 > 10 11 12 > So the issue appears to be with your command line, script, or input data > file: please show the command line used to execute the script, attach the > complete awk script, and input data file for diagnosis, or selections of the > latter piped through or output using cat -A to show control characters. > Here they are (attached). The script was executed with the following > command:> gawk -f peak_intensity_correln_plot_compile.awk Input files have \r\n line terminators and those are carried thru at the ends of the string fields: $ gawk -f peak_intensity_correln_plot_compile.awk $ file *cPuMP*.dat 2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF line terminators cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF line terminators Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text, with CR, LF line terminators $ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head 1571697^M 1716833^M 224$ 2672863^M 2894992^M 225$ 2184902^M 9710015^M 226$ 4393362^M 4095908^M 227$ 3828609^M 4218978^M 229$ 6285045^M 4008320^M 233$ 3936959^M 4104667^M 234$ 1698322^M 1942553^M 237$ 4144791^M 4346435^M 238$ 2546328^M 2804338^M 239$ You could change your input line terminators to "\r\n" e.g. option -vRS="\r\n", insert '{ sub( /\r$/, ""); before each 'split(x, s, " ")', convert your input fields from strings to numbers by adding zero i.e. cai[i] += 0; rpi[i] += 0; or use belts, braces, and suspenders with all three, e.g. $ gawk -vRS="\r\n" -f peak_intensity_correln_plot_compile.awk $ file *cPuMP*.dat 2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF line terminators cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF line terminators Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text $ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head 1571697 1716833 224$ 2672863 2894992 225$ 2184902 9710015 226$ 4393362 4095908 227$ 3828609 4218978 229$ 6285045 4008320 233$ 3936959 4104667 234$ 1698322 1942553 237$ 4144791 4346435 238$ 2546328 2804338 239$ -- Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada This email may be disturbing to some readers as it contains too much technical detail. Reader discretion is advised. [Data in IEC units and prefixes, physical quantities in SI.]