public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug libstdc++/113504] New: High memory usage for parallel `std::sort`
@ 2024-01-19 14:17 ruben.laso at tuwien dot ac.at
  2024-01-19 15:41 ` [Bug libstdc++/113504] " redi at gcc dot gnu.org
  2024-01-22  8:18 ` ruben.laso at tuwien dot ac.at
  0 siblings, 2 replies; 3+ messages in thread
From: ruben.laso at tuwien dot ac.at @ 2024-01-19 14:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113504

            Bug ID: 113504
           Summary: High memory usage for parallel `std::sort`
           Product: gcc
           Version: 12.3.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: libstdc++
          Assignee: unassigned at gcc dot gnu.org
          Reporter: ruben.laso at tuwien dot ac.at
  Target Milestone: ---

The memory usage of parallel `std::sort` is very high compared to the
sequential version and even other parallel implementations.
The attached code is a simple test case to compare the memory usage of parallel
`std::sort`, `tbb::parallel_sort` and sequential `std::sort`.
The test case has been replicated in several systems with versions of GCC 10,
11 and 12.

An example of the results (and max. memory usage according to `/usr/bin/time`)
is shown in the following table:

| Executable         | Size        | Time     | Max Resident Memory |
| ------------------ | ----------- | -------- | ------------------- |
| ./pstl_sort.out    | 33554432    | 0:00.23  | 423776k             |
| ./tbb_sort.out     | 33554432    | 0:00.44  | 143952k             |
| ./seq_sort.out     | 33554432    | 0:03.32  | 134836k             |
| ./pstl_sort.out    | 1073741824  | 0:05.68  | 14236656k           |
| ./tbb_sort.out     | 1073741824  | 0:13.02  | 4207680k            |
| ./seq_sort.out     | 1073741824  | 2:07.38  | 4198124k            |

In the example, the parallel `std::sort` (pstl_sort) uses ~3 times more memory
than the `tbb::parallel_sort` (tbb_sort) and the sequential `std::sort`
(seq_sort).
It also runs faster, though.

System specs in the example:
CPU: AMD EPYC 7551
RAM: 256 GB DDR4
OS: Debian 10.10

Compilation with:
g++ -std=c++17 -O3 -pedantic -Wall -Wextra -Werror -o pstl_sort.out main.cpp
-ltbb -DPSTL_SORT
g++ -std=c++17 -O3 -pedantic -Wall -Wextra -Werror -o tbb_sort.out main.cpp
-ltbb -DTBB_SORT
g++ -std=c++17 -O3 -pedantic -Wall -Wextra -Werror -o seq_sort.out main.cpp
-ltbb

Did I miss something in the code?
Is that high memory usage a deliberate trade-off for performance?
Is the algorithm still in development to improve memory usage?

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Bug libstdc++/113504] High memory usage for parallel `std::sort`
  2024-01-19 14:17 [Bug libstdc++/113504] New: High memory usage for parallel `std::sort` ruben.laso at tuwien dot ac.at
@ 2024-01-19 15:41 ` redi at gcc dot gnu.org
  2024-01-22  8:18 ` ruben.laso at tuwien dot ac.at
  1 sibling, 0 replies; 3+ messages in thread
From: redi at gcc dot gnu.org @ 2024-01-19 15:41 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113504

--- Comment #1 from Jonathan Wakely <redi at gcc dot gnu.org> ---
The parallel algos are taken from
https://github.com/llvm/llvm-project/tree/main/pstl so I would file an issue
upstream rather than here. The Intel PSTL developers are the right people to
ask.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Bug libstdc++/113504] High memory usage for parallel `std::sort`
  2024-01-19 14:17 [Bug libstdc++/113504] New: High memory usage for parallel `std::sort` ruben.laso at tuwien dot ac.at
  2024-01-19 15:41 ` [Bug libstdc++/113504] " redi at gcc dot gnu.org
@ 2024-01-22  8:18 ` ruben.laso at tuwien dot ac.at
  1 sibling, 0 replies; 3+ messages in thread
From: ruben.laso at tuwien dot ac.at @ 2024-01-22  8:18 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113504

--- Comment #2 from Ruben Laso <ruben.laso at tuwien dot ac.at> ---
(In reply to Jonathan Wakely from comment #1)
> The parallel algos are taken from
> https://github.com/llvm/llvm-project/tree/main/pstl so I would file an issue
> upstream rather than here. The Intel PSTL developers are the right people to
> ask.

I will ask there, then. Thank you!

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2024-01-22  8:18 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-01-19 14:17 [Bug libstdc++/113504] New: High memory usage for parallel `std::sort` ruben.laso at tuwien dot ac.at
2024-01-19 15:41 ` [Bug libstdc++/113504] " redi at gcc dot gnu.org
2024-01-22  8:18 ` ruben.laso at tuwien dot ac.at

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).