public inbox for cygwin@cygwin.com
 help / color / mirror / Atom feed
From: Corinna Vinschen <corinna-cygwin@cygwin.com>
To: cygwin@cygwin.com
Subject: Re: length in gawk returns wrong value
Date: Thu, 19 Jul 2012 09:21:00 -0000	[thread overview]
Message-ID: <20120719092024.GA31055@calimero.vinschen.de> (raw)
In-Reply-To: <loom.20120719T103849-659@post.gmane.org>

On Jul 19 08:50, Ralf wrote:
> The following lines create a file named ttt.txt. The file ttt.txt contains
> exactly what I want (oct 374 for the umlaut u). But if you look at the output of
> these lines you can see that the function length() of gawk can not handle this
> character:
> 
> uname -a
> echo "Rücken" > ttt.txt
> od -c ttt.txt
> gawk '{print "Length: " length($0)}' ttt.txt
> 
> Output:
> CYGWIN_NT-6.0-WOW64 WIESWEG 1.7.9(0.237/5/3) 2011-03-29 10:10 i686 Cygwin

Uh oh.  1.7.9 is old.  Please update.

> 0000000   R 374   c   k   e   n  \r  \n
> 0000010
> Length: 1
> 
> What can I do to get the correct length in gawk without changing the contents of
> ttt.txt?

Dunno.  This is not what I see.  What did you have $LANG and $LC_CTYPE
set to?  Here's what I see:

  $ uname -a
  CYGWIN_NT-6.1 vmbert7 1.7.16(0.261/5/3) 2012-07-09 14:51 i686 Cygwin

  $ echo $LANG
  C.UTF-8

  $ echo "Rücken" > ttt.txt
  $ od -c ttt.txt
  0000000   R 303 274   c   k   e   n  \n
  0000010

  $ gawk '{print "Length: " length($0)}' ttt.txt
  Length: 6

  $ gawk --version | head -1
  GNU Awk 4.0.1


Corinna

-- 
Corinna Vinschen                  Please, send mails regarding Cygwin to
Cygwin Project Co-Leader          cygwin AT cygwin DOT com
Red Hat

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

  reply	other threads:[~2012-07-19  9:21 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-07-19  8:50 Ralf
2012-07-19  9:21 ` Corinna Vinschen [this message]
2012-07-19 11:27   ` Ralf
2012-07-19 11:40     ` Corinna Vinschen
2012-07-19 12:36       ` Csaba Raduly
2012-07-19 13:58         ` Aaron Schneider
2012-07-19 14:56           ` Corinna Vinschen
2012-07-19 16:17             ` Aaron Schneider
2012-07-19 16:46               ` Cliff Hones
2012-07-19 16:54                 ` Aaron Schneider
2012-07-19 17:02                   ` Eric Blake
2012-07-19 17:15                     ` Aaron Schneider
2012-07-20 14:42                     ` Reini Urban
2012-07-19 17:03                   ` Cliff Hones

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120719092024.GA31055@calimero.vinschen.de \
    --to=corinna-cygwin@cygwin.com \
    --cc=cygwin@cygwin.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).