From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 24483 invoked by alias); 12 Dec 2012 23:32:11 -0000 Received: (qmail 24470 invoked by uid 22791); 12 Dec 2012 23:32:10 -0000 X-SWARE-Spam-Status: No, hits=-3.4 required=5.0 tests=AWL,BAYES_00,KHOP_RCVD_UNTRUST,KHOP_THREADED,RCVD_IN_DNSWL_LOW,RCVD_IN_HOSTKARMA_YE,SARE_MILLIONSOF X-Spam-Check-By: sourceware.org Received: from mail-ie0-f173.google.com (HELO mail-ie0-f173.google.com) (209.85.223.173) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Wed, 12 Dec 2012 23:32:03 +0000 Received: by mail-ie0-f173.google.com with SMTP id e13so2980091iej.4 for ; Wed, 12 Dec 2012 15:32:03 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:content-type:content-transfer-encoding :x-gm-message-state; bh=mOJapcZ5SjQNGG237DTHvkX05hoPVGCNsDdeMdLFRMI=; b=aDKJbOB17B4CwxjADNgqNDdmiwzF24//g06YKsztX2R+MC9MbQT8k7yN/t0TOuNwWQ W7jP+5Wy2cxy3WCuLKB6+Tb9b2VmdUOuGuG78yYkY3J837gigupktcB4t8f6L+Mlo9Ve 05oV4LVgM5XTWeErsdM0ykxwb+/V5frv8pxgFj4FnBWTcQ7WRLJrHAy6MA2F9oU6kwlO 2mt1h0Zgjv7DgZ7ejJfs0DOnsYc7jAYNpNzib1YNfUzySUGMBvvXKsIl/FmrFgKOfsBe TwXCNEZn/noefb4RmwCYr2T6ixhUUUJWxxe33SW01QAT3KD5C3nbzApaqAhVHQ2Z5BTU db6g== Received: by 10.50.53.168 with SMTP id c8mr15121813igp.10.1355355122971; Wed, 12 Dec 2012 15:32:02 -0800 (PST) Received: from MacBook-Pro-de-Maxime.local ([75.98.19.134]) by mx.google.com with ESMTPS id fv6sm2873603igc.17.2012.12.12.15.32.00 (version=SSLv3 cipher=OTHER); Wed, 12 Dec 2012 15:32:02 -0800 (PST) Message-ID: <50C913EF.2020501@calculquebec.ca> Date: Wed, 12 Dec 2012 23:32:00 -0000 From: Maxime Boissonneault User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:17.0) Gecko/17.0 Thunderbird/17.0 MIME-Version: 1.0 To: Rhys Ulerich CC: gsl-discuss@sourceware.org Subject: Re: Adding OpenMP support for some of the GSL functions References: <50C791BB.4060303@calculquebec.ca> <50C8F2E7.7070701@calculquebec.ca> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Gm-Message-State: ALoCoQnxzS7vasFzXOMavgPGcbocJtYLTFczpsYf+x9yVZ2OvuIRvjlgiRG69fSfz4H0iKxcQwq3 Mailing-List: contact gsl-discuss-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: gsl-discuss-owner@sourceware.org X-SW-Source: 2012-q4/txt/msg00007.txt.bz2 Hi Rhys, I will have a deeper look at vectorization of GSL, but in my understanding, vectorizing can only be done with simple operations, while algorithms like RKF45 involve about 10 operations per loop iterations. You are correct, I meant one problem with a million degrees of freedom. Maxime Le 2012-12-12 16:41, Rhys Ulerich a écrit : >> The more intensive function is within rkf45_apply in my case. I simply added >> a few pragmas to the loops, and it speed it up quite a lot. > Having looked at where you've placed the #pragma omp parallels, have > you tried enabling vectorization to see if the time spent in those > axpy-like operations could be improved? A good SSE-ready optimizer > should nail those. > > I may have misunderstood your "millions of differential equations" > statement. Are you rather solving one problem with a million degrees > of freedom? > > - Rhys