From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 7186 invoked by alias); 18 Feb 2019 11:48:04 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 6427 invoked by uid 89); 18 Feb 2019 11:48:03 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=ham version=3.3.2 spammy=Unless X-HELO: atfriesa01.ssi-schaefer.com Received: from atfriesa01.ssi-schaefer.com (HELO atfriesa01.ssi-schaefer.com) (193.186.16.100) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Mon, 18 Feb 2019 11:48:02 +0000 Received: from samail03.wamas.com (HELO mailhost.salomon.at) ([172.28.33.235]) by atfriesa01.ssi-schaefer.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 18 Feb 2019 12:48:00 +0100 Received: from [172.28.42.244] by mailhost.salomon.at with esmtp (Exim 4.77) (envelope-from ) id 1gvhOx-0003rW-8A; Mon, 18 Feb 2019 12:47:59 +0100 Subject: Re: textmode for stdout, what is "correct" now? References: <20190215102251.GA2702@calimero.vinschen.de> <20190215124844.GE2702@calimero.vinschen.de> <6d02258d-115d-135c-1404-1b02eec34045@ssi-schaefer.com> <20190215203108.GN2702@calimero.vinschen.de> <20190216093855.GR2702@calimero.vinschen.de> <863f060b-9c2f-1c78-30e8-c1486d567f74@ssi-schaefer.com> <20190216174313.GG4256@calimero.vinschen.de> <380fcd6e-cbe1-ebe4-c13f-a8d911c148ac@ssi-schaefer.com> <20190218102650.GW4256@calimero.vinschen.de> From: Michael Haubenwallner To: cygwin@cygwin.com Openpgp: preference=signencrypt Message-ID: <696f5a12-ad45-c3d9-715b-bd68b3f8d14c@ssi-schaefer.com> Date: Mon, 18 Feb 2019 12:15:00 -0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.5.1 MIME-Version: 1.0 In-Reply-To: <20190218102650.GW4256@calimero.vinschen.de> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-SW-Source: 2019-02/txt/msg00281.txt.bz2 On 2/18/19 11:26 AM, Corinna Vinschen wrote: > On Feb 18 10:40, Michael Haubenwallner wrote: >> On 2/16/19 6:43 PM, Corinna Vinschen wrote: >>> I really miss the problem you're trying to solve here. Why should an >>> application setting O_BINARY explicitely revert this decision on the >>> same file descriptor? That doesn't make sense. >> >> Well, it's not necessarily about really switching binary mode on and off, >> it's more about avoiding breakage when applications try to intuitively >> follow the original API, even if that actually causes the call to >> setmode(fd, O_TEXT) to be redundant. >> >> OTOH, this question also would apply to native Win32 applications, so why >> do people call setmode(fd, O_TEXT) with any DOS based platform at all? >> >> IMO, unfortunately we're not in a position to modify the intention of the >> original API. And finally, I do want to stop discussions like this one >> with application developers like openssl, as soon as we can argue like: >> "Cygwin does not use \r internally, but does support text mode mounts, >> so we had to invent the Cygwin text mode, which may or may not use \r. >> Hence you get the Cygwin text mode with O_TEXT, and if you really are >> in some 'unix2dos' position, please use the new O_DOSTEXT mode instead." >> >> However, agreed this does not seem to be trivial to implement. Yet I >> will look into it when there is a chance for a patches to be accepted. > > Bottom line: > > - Make O_TEXT equivalent to O_BINARY on the API level so Cygwin > actually uses binary mode on open(O_TEXT) and setmode(O_TEXT). No, O_TEXT is neither equal to O_BINARY nor to O_DOSTEXT - it's something in between. My first ideas are either (O_BINARY|O_DOSTEXT) or another bit. > - Make O_DOSTEXT equivalent to the former O_TEXT. Yes. > Result: we use binary mode even with tools explicitely specifying O_TEXT. No, not binary mode. It's text mode with \r being allowed rather than forced. > - How do you handle fopen modes "wt", "rt", etc, then? This API feels a little different, because it does allow for with "b" and without "b", as well as with "t" and without "t", so "t" should translate to O_DOSTEXT IMO. While talking about it: What will setmode(fd, 0) do? > - What about automode.c, binmode.c, textmode.c, textreadmode.c? Just stumbled over the distinction between readmode and writemode: What's up with that? Unless binary mode, reading always could be done in dostext mode. Here the default is to link without them, and the opposite of binmode.c is to not use anything, hence the text*mode should be O_DOSTEXT. > - How do you avoid breakage of existing tools which have been written to > work explicitely with certain DOS formatted text file and use O_TEXT > for that? > > The answer to the last one could be using a new version check like the > ones already in include/cygwin/version.h. Existing tools and libs keep > the current behaviour. Only newly built binaries get the new behaviour. Exactly. And for the check: For dostext mode: ifdef O_DOSTEXT: use O_DOSTEXT, otherways use O_TEXT. For cygtext mode: ifdef O_DOSTEXT: use O_TEXT, otherways avoid setmode. > However, this still may result in breakage if the developer isn't aware > of this subtil change. As much as I hate O_TEXT mode, there's a > pretty basic expectation how this is supposed to work. Yes, but I do expect this in corner cases only, with unix2dos/dos2unix as the specific example. OTOH, with setmode(fd, 0) coming to my mind: If that would denote the default (=cygwin text) mode, I can imagine we may convince (openssl) developers to use zero instead of O_TEXT, and everything could be fine without any Cygwin change. Heck, this would feel like most obvious - even API wise, no? Then we may want to add O_NOBINARY defined to zero as the only Cygwin change. Thanks! /haubi/ -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple