From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 87235 invoked by alias); 29 Sep 2017 17:53:22 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 87224 invoked by uid 89); 29 Sep 2017 17:53:21 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=3.0 required=5.0 tests=AWL,BAYES_00,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=no version=3.3.2 spammy=launches, H*c:alternative, hiragana, ernie X-HELO: mail-pf0-f173.google.com Received: from mail-pf0-f173.google.com (HELO mail-pf0-f173.google.com) (209.85.192.173) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Fri, 29 Sep 2017 17:53:20 +0000 Received: by mail-pf0-f173.google.com with SMTP id e1so187146pfk.1 for ; Fri, 29 Sep 2017 10:53:20 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=u4vcE8OSjkmuDCNjR+dwuP88jslblQEyvVH2imDwDeg=; b=VuWqIt0sKA2YlaqEbWUijF+zC6vEVxg7+kRfHRoumLvtsls7AV7ldf8QmFEvbgqNTZ hj7fUB3q6CJJtDXaZr19HyKtHlU6oD6gSK41mwR8jojxOYZHhoUAs9/fOr2wp1mWM+H/ 9HfiL+nl9bHSEYjGmo8fHv6SVo8Nog41q+CUrY8lF2WjHa7owOUO8LnYvIAL9OKER0j8 nW5VZlErRJgxNzBnmuA3zaFlpY2EXGdqiwWHJN7GDuLDMSg32qNaHlTZq3mpGAswJzCv /anchqsoecBTUSRs8Ol0CQ8PmoEIR+rD96+hqyxUo+amFTFZU86TTYgAQcoBK4QITwqj 862Q== X-Gm-Message-State: AHPjjUh5BZxIMEMIri9g3rMH+S71L/abgkzCmCfYXLCllKmB1tNeTbTp mexKEXWirGjqhediRmM3stqnKA1Bi7vrybybBqEPu3zE6q4= X-Google-Smtp-Source: AOwi7QBOwTaZ/xS+kGfiwbkd3iuqJGovH+rnfGsYSqsUrYGOibAoi3WKyfUIWHx8zfOVkQ3SzdCjDKH07WqTM/6jorE= X-Received: by 10.98.69.137 with SMTP id n9mr8684041pfi.164.1506707597991; Fri, 29 Sep 2017 10:53:17 -0700 (PDT) MIME-Version: 1.0 Received: by 10.100.151.132 with HTTP; Fri, 29 Sep 2017 10:53:17 -0700 (PDT) From: Ernie Coskrey Date: Fri, 29 Sep 2017 20:16:00 -0000 Message-ID: Subject: Internationalization issue - string processing To: Cygwin Content-Type: text/plain; charset="UTF-8" X-SW-Source: 2017-09/txt/msg00303.txt.bz2 We have a Java program that launches Cygwin bash processes which in turn run a script. The LC_ALL variable is set to "ja_JP". The script will execute processes using Unicode strings that are specified like this: "\u3053" (for the Hiragana letter Ko). For some reason, when bash calls another program and passes the string above to it, the string is being converted to "0x3f 0x3f". The script that is being run contains the following command: perl dump.pl "\u3053" The perl script just prints out the hex values of its arguments, and it displays: ?? 3f 3f The behavior is not reproducible if we run bash from a CMD prompt. I know this is pretty open-ended but are there any ideas as to what might be causing this sort of localization issue? Ernie Coskrey SIOS Technology Corp. -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple