public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed
From: Arthur Cohen <arthur.cohen@embecosm.com>
To: connor horman <chorman64@gmail.com>, gcc@gcc.gnu.org
Subject: Re: [Request for Comments] Using Rust libraries in the Rust frontend
Date: Fri, 26 Jan 2024 10:30:01 +0100	[thread overview]
Message-ID: <677c3005-9f4f-4f3b-9eb7-4233f9f9b7e1@embecosm.com> (raw)
In-Reply-To: <CADLV35g=ruH-j1hDBdE=SmPamv9GKZ1grRcF52S-0apUK8r-nQ@mail.gmail.com>

Hi Connor,

On 1/25/24 16:55, connor horman wrote:
> Note that it may not make sense to include a source copy of rustc, as 
> that will itself require an (earlier) stage of rustc to build. mrustc 
> can offer a bootstrap for 1.54, but depending on the versions required, 
> you may need upwards of 10 additional rustc sources.

Yes, I was a bit worried about the idea but it seems this is just me not 
knowing how to read so all is good. Using mrustc for this is a great 
idea however.

Thanks!

Arthur

> 
> On Thu, 25 Jan 2024 at 10:04, Arthur Cohen <arthur.cohen@embecosm.com 
> <mailto:arthur.cohen@embecosm.com>> wrote:
> 
>     Hi Richard,
> 
>     On 1/23/24 08:23, Richard Biener wrote:
>      > On Mon, Jan 22, 2024 at 7:51 PM Arthur Cohen
>     <arthur.cohen@embecosm.com <mailto:arthur.cohen@embecosm.com>> wrote:
>      >>
>      >> Hi everyone,
>      >>
>      >> In order to increase the development speed of Rust features, we are
>      >> seeking feedback on reusing some Rust components directly within our
>      >> front-end. As mentioned in other conferences, the most important
>      >> component we would like to integrate into the front-end is the
>     polonius
>      >> borrow-checker. Another component we've become interested in is the
>      >> `rustc_format_parser` library, responsible for handling parsing and
>      >> handling of format arguments as described in the documentation for
>      >> `std::fmt` [1].
>      >>
>      >> However, since these libraries are written in Rust, GCC in
>     itself is not
>      >> yet able to compile them. They all depend on the Rust standard
>     library,
>      >> which we cannot yet compile and link properly. This obviously
>     raises a
>      >> question - how to actually compile, integrate and distribute these
>      >> components?
>      >>
>      >> We do have the option to rewrite them from scratch, but we feel that
>      >> spending time on these existing, correct, battle-tested and easily
>      >> integrable components instead of focusing on other aspects of the
>      >> language would be a mistake. Spending this time instead on Rust
>     features
>      >> that we are missing for compiling these components would, in our
>      >> opinion, yield more results, as it would also help in compiling
>     other
>      >> Rust programs.
>      >>
>      >> We could either distribute these components as compiled
>     libraries, or
>      >> look at integrating the official Rust compiler to our build
>     system as a
>      >> temporary measure. I am aware that this would mean restricting
>     the Rust
>      >> GCC front-end to platforms where the official Rust compiler is also
>      >> available, which is less than ideal.
>      >
>      > But that's only for the host part - you can still cross compile
>     to another
>      > target and possibly, once the GCC frontend can compile these
>     libraries
>      > itself, also use them to bootstrap a hosted version on that target -
>      > speaking of ..
>      >
>      >> However, this would only be
>      >> temporary - as soon as the Rust front-end is able to compile these
>      >> components, we would simply reuse them and compile them with
>     gccrs as
>      >> part of our bootstrapping process.
>      >
>      > .. gccrs would then need to be able to build itself without those
>     modules,
>      > at least during stage1 so that the stage1 compiler can then be
>     used to
>      > build them.  Or you go like m2 and build a "mini-rust" that's
>     just capable
>      > of building the modules.
> 
>     Right, that makes a lot of sense. We should definitely be able to build
>     the format string parser without a format string parser, as it does not
>     use format strings for error handling or anything. And even if it did,
>     it would be pretty easy to remove that and do the formatting by hand.
> 
>     Similarly, the borrow checker is not "needed" for compilation and we do
>     plan on building stage1 without it, while making it mandatory for
>     stage2/3 builds.
> 
>      > I think re-using parts already available is very sensible at this
>     point.  Note
>      > that while we might temporarily require a host rust compiler to
>     boostrap
>      > gccrs I'd rather not have the build system download something
>     from the
>      > internet - so at least the sources of those dependences need to
>     be in the
>      > GCC repository, possibly in a new toplevel directory.
> 
>     Okay, that makes a lot of sense. I was thinking of adding a basic check
>     for the Rust compiler to be present when compiling these components -
>     and error out if that isn't the case. Are you suggesting we embed a
>     full
>     copy of rustc in GCC and build it from source when compiling the Rust
>     frontend? Or am I misunderstanding?
> 
>      >> The documentation for `std::fmt` [1] describes all of the features
>      >> available in Rust format strings. It also contains a grammar for the
>      >> format-string parser, which we would need to re-implement on top of
>      >> supporting all the formatting features. As a prototype, I wrote
>     an FFI
>      >> interface to the `rustc_format_parser`  library and integrated
>     it to our
>      >> macro expansion system, which took me less than a couple hours.
>     In less
>      >> than an afternoon, we had bindings for all of the exported types and
>      >> functions in the library and had access to a compliant and
>     performant
>      >> Rust format string parser. But re-implementing a correct
>      >> `format_args!()` parser - with the same performance as the Rust
>     one, and
>      >> the same amount of features - would probably take days, if not
>     weeks.
>      >> And we are talking about one of the simplest components we aim
>     to reuse.
>      >> Something like a fully-compliant trait solver for the Rust
>     programming
>      >> language would take months if not years to implement properly
>     from scratch.
>      >>
>      >> I would like to stress once again that relying on distributing
>     compiled
>      >> libraries or using `rustc` in our build system would be
>     temporary, and
>      >> that these measures would be removed as soon as gccrs is able to
>     compile
>      >> these components from source.
>      >>
>      >> I am looking for comments on this decision as well as the best
>     practices
>      >> we could adopt. Have there been any similar decisions for other
>      >> self-hosted front-ends? Any previous email threads/commits that you
>      >> could point me to would be greatly appreciated.
>      >
>      > Not in this very same way but the D frontend is a shim around the
>     official
>      > DMD frontend and the Go frontend imports the golang standard library
>      > (but the frontend itself is written in C++ and doesn't use any
>     part of it).
>      >
>      > The C++ frontend uses part of the C++ standard library (but it can of
>      > course build that itself - but it requires a host C++ compiler
>     with library).
> 
>     Thanks for the pointers. I was wondering if this is something that the
>     Ada frontend had faced at the beginning, but I've been told it does not
>     have a lot of dependencies anyway so this might not be helpful.
> 
>      >
>      > Richard.
>      >
>      >> Thanks,
>      >>
>      >> Arthur
>      >>
>      >> [1]: https://doc.rust-lang.org/std/fmt/
>     <https://doc.rust-lang.org/std/fmt/>
> 
>     Thanks a lot for taking the time, I really appreciate it.
> 
>     Best,
> 
>     Arthur
> 

  reply	other threads:[~2024-01-26  9:30 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-01-22 17:30 Arthur Cohen
2024-01-23  7:23 ` Richard Biener
2024-01-25 15:03   ` Arthur Cohen
2024-01-25 15:55     ` connor horman
2024-01-26  9:30       ` Arthur Cohen [this message]
2024-01-25 16:26     ` Iain Sandoe
2024-01-26  9:28       ` Arthur Cohen
2024-01-26 21:39       ` NightStrike
2024-01-26 23:25         ` David Starner
2024-01-25 18:19     ` Richard Biener
2024-01-26  9:24       ` Arthur Cohen
2024-01-26 21:41 ` NightStrike

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=677c3005-9f4f-4f3b-9eb7-4233f9f9b7e1@embecosm.com \
    --to=arthur.cohen@embecosm.com \
    --cc=chorman64@gmail.com \
    --cc=gcc@gcc.gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).