* GSL K0/K1 [not found] <56F42FED.8060500@colorado.edu> @ 2016-03-24 18:22 ` Patrick Alken 2016-03-25 20:03 ` Gerard Jungman 2016-03-25 21:24 ` error estimates (was: GSL K0/K1) Gerard Jungman 0 siblings, 2 replies; 7+ messages in thread From: Patrick Alken @ 2016-03-24 18:22 UTC (permalink / raw) To: gsl-discuss Pavel, The Bessel function results look good. If everything is tested you can merge it all into the master branch. I'm cc'ing gsl-discuss so that this discussion will start to be archived. Regarding the error estimates, an attempt was made early on in GSL to provide error estimates for all the special functions. From section 7.2 of the manual: ---- The field val contains the value and the field err contains an estimate of the absolute error in the value. ---- I confess I don't fully understand all the error estimation, or even how accurate the estimates are. I don't know if any studies were done to verify the accuracy of these error estimates. One of the original GSL developers (Gerard Jungman) wrote a lot of this code (I've cc'd him on this email) - maybe he can help shed light on this. Since you've already done exhaustive comparisons of your new Bessel functions against arbitrary precision libraries, maybe you could simply change the error estimate to be: err = factor * GSL_DBL_EPSILON where factor are the numbers you've found in your error analysis. This way the calculation would be very fast (and probably more accurate than the current error estimation). We can wait and see if Gerard has any suggestions on this too. Gerard - since you haven't been following this email chain, Pavel has found more accurate Chebyshev expansions for the Bessel K0/K1 functions which have lower errors than the original GSL implementations when compared with arbitrary precision libraries. Through lots of random testing he has found upper limits on the errors of each function given by some positive constant times GSL_DBL_EPSILON. We are wondering if its safe to change the error estimation of these functions (I don't know the origin of the current calculations for these error estimates). Patrick ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: GSL K0/K1 2016-03-24 18:22 ` GSL K0/K1 Patrick Alken @ 2016-03-25 20:03 ` Gerard Jungman 2016-03-25 21:18 ` Patrick Alken 2016-03-25 22:20 ` Brian Gladman 2016-03-25 21:24 ` error estimates (was: GSL K0/K1) Gerard Jungman 1 sibling, 2 replies; 7+ messages in thread From: Gerard Jungman @ 2016-03-25 20:03 UTC (permalink / raw) To: gsl-discuss As you say, this is the first I have seen of this. The results look very good. I briefly compared the code to the old version. It looks like you changed the domains for some of the expansions as well as the coefficients and some of the fractional transformations. So that's a lot of differences to track. Just glancing at the differences, I cannot tell why it is so much better. Do you know exactly what happened? The main problem seemed to be for x in (1,2). The big break in the 1/x plots at x=2 suggests that there was a problem in the coefficients in that domain. Chebyshev expansions should not behave that way; they should provide essentially uniform error over the domain. Was it some kind of typo in the tables? That's what it looks like to me. But maybe not... in fact I don't see how that would have happened. Maybe it was something more subtle. The basic method, of course, goes back to SLATEC, so I suppose that version has the same problems...? We can and should have a discussion about error estimates in general. But I will create another thread for that, so as not to expand or hijack this topic. As for the method and error estimates for K0 and K1, I am trying to understand myself what is happening there. Most of it is straightforward; the only problem is the domain x <= 2.0. I wrote this code about 20 years ago and haven't looked at it since. The assumption was that the original SLATEC was basically best possible, so it did not get a lot of scrutiny. I looked at the SLATEC source. In the code I refer to "bk0.f", although it seems to actually be called "besk0.f", so maybe there have been further changes to SLATEC since then? Anyway, SLATEC uses this method where it evaluates I0 for some reason. So I do the same. I have no idea why they do this. Maybe there is an original paper somewhere that explains it. Clearly it is not necessary, and it's much cleaner to avoid this dependency as well. If the expansion was correct in the old code, then the error must have been creeping in through the evaluation of I0. Has anybody checked I0? If it is also messed up, then maybe we have found the culprit. The error estimate in that section is just an application of some derivative estimates for that particular expression. But why that expression is used is entirely a mystery to me. A brief comment on the random testing: This is a good way to estimate the coefficient in the uniform error bound. But in principle it should not be necessary. There must be rigorous bounds for these expansions based on the uniformity of the Chebyshev expansions. Can the necessary coefficients be evaluated directly as part of the computation of the expansions? That would be best of all. These error bounds could be documented somewhere for posterity, at the least. -- G. Jungman On 03/24/2016 12:22 PM, Patrick Alken wrote: > Pavel, > > The Bessel function results look good. If everything is tested you can > merge it all into the master branch. I'm cc'ing gsl-discuss so that this > discussion will start to be archived. > > Regarding the error estimates, an attempt was made early on in GSL to > provide error estimates for all the special functions. From section 7.2 > of the manual: > > ---- > The field val contains the value and the field err contains an estimate > of the absolute error in the value. > ---- > > I confess I don't fully understand all the error estimation, or even how > accurate the estimates are. I don't know if any studies were done to > verify the accuracy of these error estimates. One of the original GSL > developers (Gerard Jungman) wrote a lot of this code (I've cc'd him on > this email) - maybe he can help shed light on this. > > Since you've already done exhaustive comparisons of your new Bessel > functions against arbitrary precision libraries, maybe you could simply > change the error estimate to be: > > err = factor * GSL_DBL_EPSILON > > where factor are the numbers you've found in your error analysis. This > way the calculation would be very fast (and probably more accurate than > the current error estimation). > > We can wait and see if Gerard has any suggestions on this too. > > Gerard - since you haven't been following this email chain, Pavel has > found more accurate Chebyshev expansions for the Bessel K0/K1 functions > which have lower errors than the original GSL implementations when > compared with arbitrary precision libraries. Through lots of random > testing he has found upper limits on the errors of each function given > by some positive constant times GSL_DBL_EPSILON. We are wondering if its > safe to change the error estimation of these functions (I don't know the > origin of the current calculations for these error estimates). > > Patrick ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: GSL K0/K1 2016-03-25 20:03 ` Gerard Jungman @ 2016-03-25 21:18 ` Patrick Alken 2016-03-26 5:43 ` Pavel Holoborodko 2016-03-25 22:20 ` Brian Gladman 1 sibling, 1 reply; 7+ messages in thread From: Patrick Alken @ 2016-03-25 21:18 UTC (permalink / raw) To: gsl-discuss, Pavel Holoborodko Gerard, Yes sorry for including you so late in the discussion! Pavel (cc'd) found that some of the GSL Chebyshev coefficients copied over from SLATEC were rounded incorrectly (ie: just the very last digit or two was off which caused quite a large error near x=2 for K0 and K1). The original GSL / SLATEC implementations broke up the K0/K1 interval in 3 pieces: [0,2] [2,8] [8,inf] Pavel developed new approximations using the intervals: [0,1] (new rational approximation from Pavel) [1,8] (new Chebyshev from Pavel) [8,inf] (original GSL Chebyshev) which avoids the troublesome point x=2 and has an overall lower error when compared with arbitrary precision libraries over many random points. In principle we could simply correct the SLATEC rounding errors, but Pavel's new method has even lower error than the corrected Chebyshev series. During some exhaustive numerical testing with random values, Pavel found fairly good upper limits to the error on the K0/K1 on each of the 3 intervals, which is why we think we should change the error estimate to the empirical values found through the numerical simulations. Pavel found similar problems in I0/I1 which he is now working on correcting. Patrick On 03/25/2016 02:03 PM, Gerard Jungman wrote: > > As you say, this is the first I have seen of this. > The results look very good. > > I briefly compared the code to the old version. > It looks like you changed the domains for some > of the expansions as well as the coefficients > and some of the fractional transformations. > So that's a lot of differences to track. > > Just glancing at the differences, I cannot tell > why it is so much better. Do you know exactly > what happened? > > The main problem seemed to be for x in (1,2). > The big break in the 1/x plots at x=2 suggests > that there was a problem in the coefficients > in that domain. Chebyshev expansions should > not behave that way; they should provide > essentially uniform error over the domain. > > Was it some kind of typo in the tables? > That's what it looks like to me. But > maybe not... in fact I don't see how > that would have happened. > > Maybe it was something more subtle. The basic method, > of course, goes back to SLATEC, so I suppose that > version has the same problems...? > > > We can and should have a discussion about error estimates > in general. But I will create another thread for that, so > as not to expand or hijack this topic. > > > As for the method and error estimates for K0 and K1, I am > trying to understand myself what is happening there. Most > of it is straightforward; the only problem is the domain > x <= 2.0. I wrote this code about 20 years ago and > haven't looked at it since. The assumption was that > the original SLATEC was basically best possible, so > it did not get a lot of scrutiny. > > I looked at the SLATEC source. In the code I refer to "bk0.f", > although it seems to actually be called "besk0.f", so maybe > there have been further changes to SLATEC since then? > > Anyway, SLATEC uses this method where it evaluates I0 for > some reason. So I do the same. I have no idea why they do > this. Maybe there is an original paper somewhere that > explains it. Clearly it is not necessary, and it's > much cleaner to avoid this dependency as well. > If the expansion was correct in the old code, > then the error must have been creeping in > through the evaluation of I0. > > Has anybody checked I0? If it is also messed up, > then maybe we have found the culprit. > > The error estimate in that section is just an application > of some derivative estimates for that particular expression. > But why that expression is used is entirely a mystery to me. > > > A brief comment on the random testing: This is a good way > to estimate the coefficient in the uniform error bound. > But in principle it should not be necessary. There must > be rigorous bounds for these expansions based on the > uniformity of the Chebyshev expansions. Can the > necessary coefficients be evaluated directly as > part of the computation of the expansions? That > would be best of all. These error bounds could > be documented somewhere for posterity, at the > least. > > -- > G. Jungman > > > On 03/24/2016 12:22 PM, Patrick Alken wrote: >> Pavel, >> >> The Bessel function results look good. If everything is tested you >> can >> merge it all into the master branch. I'm cc'ing gsl-discuss so that this >> discussion will start to be archived. >> >> Regarding the error estimates, an attempt was made early on in GSL to >> provide error estimates for all the special functions. From section 7.2 >> of the manual: >> >> ---- >> The field val contains the value and the field err contains an estimate >> of the absolute error in the value. >> ---- >> >> I confess I don't fully understand all the error estimation, or even how >> accurate the estimates are. I don't know if any studies were done to >> verify the accuracy of these error estimates. One of the original GSL >> developers (Gerard Jungman) wrote a lot of this code (I've cc'd him on >> this email) - maybe he can help shed light on this. >> >> Since you've already done exhaustive comparisons of your new Bessel >> functions against arbitrary precision libraries, maybe you could simply >> change the error estimate to be: >> >> err = factor * GSL_DBL_EPSILON >> >> where factor are the numbers you've found in your error analysis. This >> way the calculation would be very fast (and probably more accurate than >> the current error estimation). >> >> We can wait and see if Gerard has any suggestions on this too. >> >> Gerard - since you haven't been following this email chain, Pavel has >> found more accurate Chebyshev expansions for the Bessel K0/K1 functions >> which have lower errors than the original GSL implementations when >> compared with arbitrary precision libraries. Through lots of random >> testing he has found upper limits on the errors of each function given >> by some positive constant times GSL_DBL_EPSILON. We are wondering if its >> safe to change the error estimation of these functions (I don't know the >> origin of the current calculations for these error estimates). >> >> Patrick > ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: GSL K0/K1 2016-03-25 21:18 ` Patrick Alken @ 2016-03-26 5:43 ` Pavel Holoborodko 0 siblings, 0 replies; 7+ messages in thread From: Pavel Holoborodko @ 2016-03-26 5:43 UTC (permalink / raw) To: Patrick Alken; +Cc: gsl-discuss Dear Patrick and Gerard, Let me comment on the situation with K0/1 as I see some confusion. A. The main issue was NOT the incorrectly rounded coefficients (it was also contributing to overall accuracy loss but wasn't the main issue). The issues with original SLATEC/GSL computations of K1/0 (in order of increasing importance): 1. (Minor) Incorrectly rounded coefficients for Chebyshev expansion. All Bessel routines suffer from the issue - the most affected is I0/1. 2. (Moderate) Cancellation near x=2 because -log(x/2) was computed as -lx+M_LN2 (subtraction of near-equal numbers near x=2). Fix for the issue improved accuracy by two times near x=2 - from 10.52eps to 5.11eps. 3. (Severe). In general, Chebyshev expansion for K0/1 on [0,2] is unstable when used in finite precision floating-point arithmetic. It suffers from cancellation error - tries to compute small values by subtracting large ones. Theoretically (when we ignore floating-point effects) it gives high accuracy. But it is quite disastrous when it comes to practical finite precision arithmetic. As a result, we had accuracy loss on first interval of approximation used in SLATEC/GSL for K0: [0,2] ~ 10.52 eps [2,8] ~ 1.25 eps (8,inf] ~ 1.25 eps In order to resolve the third issue, I derived rational approximation for [0,1] and new Chebyshev expansion for [1,8]. Rational approximation was built taking into account not only theoretical approximation order but also with minimization of effects of floating-point. I think this is very important design constrain which was overlooked in original SLATEC, etc. Theoretical bounds is informative but should be considered only together with practical limitations of finite precision arithmetic. Now we have for K0: [0,1] ~ 2.04 eps (rational approximation) [1,8] ~ 1.28 eps (new Chebyshev) (8,inf] ~ 1.25 eps (original Chebyshev from SLATEC) Results for K1 is similar. Interestingly, other software based on SLATEC code (including MATLAB) suffer from the same accuracy loss. I published some plots here: http://goo.gl/KUodAu B. Probably the real-time error analysis was needed when no uniform floating-point standard existed. I think we can safely drop this thanks to 100% widespread of IEEE 754. However I think error bounds should be computed based on uniform sampling in real conditions for the reasons above. Also it is very difficult to take into account all rounding effects coming from arithmetic operations, elementary functions, etc. included in expressions. Testing against extended precision combines everything automatically and gives meaningful error bound (for floating-point arithmetic of IEEE 754 standard). C. I0/1 have decisive influence of K0/1 near zero. For example: K0(x) = -(log(x/2)+g)*I0(x)+... In the same time I0/1 have power series with extremely fast convergency (+all terms are positive, no cancellation). Thus it has sense to use I0/1 for computing the K0/1 on [0,1]. In fact it is probably the optimal way. D. Accuracy of I0/1 in GSL. Chebyshev expansions are excellent for I0/1. The only issue here is incorrectly rounded coefficients of expansion - this increases relative error by ~2 times. I am still studying the code and there is a chance that rational approximations might give better accuracy when x->inf. Pavel. On Sat, Mar 26, 2016 at 6:18 AM, Patrick Alken <alken@colorado.edu> wrote: > Gerard, > > Yes sorry for including you so late in the discussion! Pavel (cc'd) found > that some of the GSL Chebyshev coefficients copied over from SLATEC were > rounded incorrectly (ie: just the very last digit or two was off which > caused quite a large error near x=2 for K0 and K1). > > The original GSL / SLATEC implementations broke up the K0/K1 interval in 3 > pieces: > [0,2] > [2,8] > [8,inf] > > Pavel developed new approximations using the intervals: > > [0,1] (new rational approximation from Pavel) > [1,8] (new Chebyshev from Pavel) > [8,inf] (original GSL Chebyshev) > > which avoids the troublesome point x=2 and has an overall lower error when > compared with arbitrary precision libraries over many random points. > > In principle we could simply correct the SLATEC rounding errors, but Pavel's > new method has even lower error than the corrected Chebyshev series. > > During some exhaustive numerical testing with random values, Pavel found > fairly good upper limits to the error on the K0/K1 on each of the 3 > intervals, which is why we think we should change the error estimate to the > empirical values found through the numerical simulations. > > Pavel found similar problems in I0/I1 which he is now working on correcting. > > Patrick > > On 03/25/2016 02:03 PM, Gerard Jungman wrote: >> >> >> As you say, this is the first I have seen of this. >> The results look very good. >> >> I briefly compared the code to the old version. >> It looks like you changed the domains for some >> of the expansions as well as the coefficients >> and some of the fractional transformations. >> So that's a lot of differences to track. >> >> Just glancing at the differences, I cannot tell >> why it is so much better. Do you know exactly >> what happened? >> >> The main problem seemed to be for x in (1,2). >> The big break in the 1/x plots at x=2 suggests >> that there was a problem in the coefficients >> in that domain. Chebyshev expansions should >> not behave that way; they should provide >> essentially uniform error over the domain. >> >> Was it some kind of typo in the tables? >> That's what it looks like to me. But >> maybe not... in fact I don't see how >> that would have happened. >> >> Maybe it was something more subtle. The basic method, >> of course, goes back to SLATEC, so I suppose that >> version has the same problems...? >> >> >> We can and should have a discussion about error estimates >> in general. But I will create another thread for that, so >> as not to expand or hijack this topic. >> >> >> As for the method and error estimates for K0 and K1, I am >> trying to understand myself what is happening there. Most >> of it is straightforward; the only problem is the domain >> x <= 2.0. I wrote this code about 20 years ago and >> haven't looked at it since. The assumption was that >> the original SLATEC was basically best possible, so >> it did not get a lot of scrutiny. >> >> I looked at the SLATEC source. In the code I refer to "bk0.f", >> although it seems to actually be called "besk0.f", so maybe >> there have been further changes to SLATEC since then? >> >> Anyway, SLATEC uses this method where it evaluates I0 for >> some reason. So I do the same. I have no idea why they do >> this. Maybe there is an original paper somewhere that >> explains it. Clearly it is not necessary, and it's >> much cleaner to avoid this dependency as well. >> If the expansion was correct in the old code, >> then the error must have been creeping in >> through the evaluation of I0. >> >> Has anybody checked I0? If it is also messed up, >> then maybe we have found the culprit. >> >> The error estimate in that section is just an application >> of some derivative estimates for that particular expression. >> But why that expression is used is entirely a mystery to me. >> >> >> A brief comment on the random testing: This is a good way >> to estimate the coefficient in the uniform error bound. >> But in principle it should not be necessary. There must >> be rigorous bounds for these expansions based on the >> uniformity of the Chebyshev expansions. Can the >> necessary coefficients be evaluated directly as >> part of the computation of the expansions? That >> would be best of all. These error bounds could >> be documented somewhere for posterity, at the >> least. >> >> -- >> G. Jungman >> >> >> >> On 03/24/2016 12:22 PM, Patrick Alken wrote: >>> >>> Pavel, >>> >>> The Bessel function results look good. If everything is tested you can >>> merge it all into the master branch. I'm cc'ing gsl-discuss so that this >>> discussion will start to be archived. >>> >>> Regarding the error estimates, an attempt was made early on in GSL to >>> provide error estimates for all the special functions. From section 7.2 >>> of the manual: >>> >>> ---- >>> The field val contains the value and the field err contains an estimate >>> of the absolute error in the value. >>> ---- >>> >>> I confess I don't fully understand all the error estimation, or even how >>> accurate the estimates are. I don't know if any studies were done to >>> verify the accuracy of these error estimates. One of the original GSL >>> developers (Gerard Jungman) wrote a lot of this code (I've cc'd him on >>> this email) - maybe he can help shed light on this. >>> >>> Since you've already done exhaustive comparisons of your new Bessel >>> functions against arbitrary precision libraries, maybe you could simply >>> change the error estimate to be: >>> >>> err = factor * GSL_DBL_EPSILON >>> >>> where factor are the numbers you've found in your error analysis. This >>> way the calculation would be very fast (and probably more accurate than >>> the current error estimation). >>> >>> We can wait and see if Gerard has any suggestions on this too. >>> >>> Gerard - since you haven't been following this email chain, Pavel has >>> found more accurate Chebyshev expansions for the Bessel K0/K1 functions >>> which have lower errors than the original GSL implementations when >>> compared with arbitrary precision libraries. Through lots of random >>> testing he has found upper limits on the errors of each function given >>> by some positive constant times GSL_DBL_EPSILON. We are wondering if its >>> safe to change the error estimation of these functions (I don't know the >>> origin of the current calculations for these error estimates). >>> >>> Patrick >> >> > ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: GSL K0/K1 2016-03-25 20:03 ` Gerard Jungman 2016-03-25 21:18 ` Patrick Alken @ 2016-03-25 22:20 ` Brian Gladman 1 sibling, 0 replies; 7+ messages in thread From: Brian Gladman @ 2016-03-25 22:20 UTC (permalink / raw) To: gsl-discuss On 25/03/2016 20:03, Gerard Jungman wrote: > > As you say, this is the first I have seen of this. > The results look very good. > > I briefly compared the code to the old version. > It looks like you changed the domains for some > of the expansions as well as the coefficients > and some of the fractional transformations. > So that's a lot of differences to track. > > Just glancing at the differences, I cannot tell > why it is so much better. Do you know exactly > what happened? > > The main problem seemed to be for x in (1,2). > The big break in the 1/x plots at x=2 suggests > that there was a problem in the coefficients > in that domain. Chebyshev expansions should > not behave that way; they should provide > essentially uniform error over the domain. > > Was it some kind of typo in the tables? > That's what it looks like to me. But > maybe not... in fact I don't see how > that would have happened. The coefficients were not correctly rounded and this caused larger than expected relative errors. There is also a logarithmic singularity at x = 0 that was computed in a way that led to some loss of precision through cancellation. > As for the method and error estimates for K0 and K1, I am > trying to understand myself what is happening there. Most > of it is straightforward; the only problem is the domain > x <= 2.0. I wrote this code about 20 years ago and > haven't looked at it since. The assumption was that > the original SLATEC was basically best possible, so > it did not get a lot of scrutiny. > > I looked at the SLATEC source. In the code I refer to "bk0.f", > although it seems to actually be called "besk0.f", so maybe > there have been further changes to SLATEC since then? > > Anyway, SLATEC uses this method where it evaluates I0 for > some reason. So I do the same. I have no idea why they do > this. For low x the modified Bessel function of the second kind for order 0 is evaluated using the definition: K0(x) = -(log(x/2) + gamma)I0(x) + polynomial(x^2) where I0(x) is the modified function of the first kind. In GSL the polynomial component is developed as the chebyshev series expansion under discussion. Maybe there is an original paper somewhere that > explains it. Clearly it is not necessary, and it's > much cleaner to avoid this dependency as well. > If the expansion was correct in the old code, > then the error must have been creeping in > through the evaluation of I0. The dependence of Kn(x) on In(x) for low x is not avoidable because the functions of the second kind have a logarithmic sigularity at x = 0. And, as you surmise, additional loss of precision was creeping in because the expansion coefficients for I0(x) were also incorrectly rounded. Brian Gladman ^ permalink raw reply [flat|nested] 7+ messages in thread
* error estimates (was: GSL K0/K1) 2016-03-24 18:22 ` GSL K0/K1 Patrick Alken 2016-03-25 20:03 ` Gerard Jungman @ 2016-03-25 21:24 ` Gerard Jungman 2016-03-25 21:37 ` error estimates Gerard Jungman 1 sibling, 1 reply; 7+ messages in thread From: Gerard Jungman @ 2016-03-25 21:24 UTC (permalink / raw) To: gsl-discuss On the question of error estimates. I have addressed some of this before on this list, although in more general contexts. Here is what I have to say about error estimates in particular. Warning: This is probably going to be a long one, like many of my cries of despair on this list... First, some agreement on the behavior of function evaluations. The best possible behavior for an evaluation is to conform to the following specification: The function f(x) must return a result, for a machine represented number x, which is the closest possible machine representable value to the mathematically correct value, correctly rounded. For something like the platform-provided sin(x) and cos(x), this is definitely the specification. Of course, some implementations fall short. There are examples, and associated discussions to be googled... https://randomascii.wordpress.com/2014/10/09/intel-underestimates-error-bounds-by-1-3-quintillion/ I have not checked this one myself. But you get the idea. I hope this one is not really true... anybody want to check? If a function conforms to the best-possible specification, then there is no need for an error estimate. The estimate is implicit in the contract. But this is hard enough to achieve, even for sin(x). If we stuck to that, there might no functions at all. So we sacrifice consistency and quality for coverage. The overriding problem with the special functions is the wide range of quality. Some of the functions are pretty good (although maybe not as good as hoped, i.e. K0/K1). Some are kind of difficult and lose in corner cases. And some are just plain nuts. Amongst the "just plain nuts" examples, the confluent hypergeometrics stick out. The error estimates are a kind of apology for this state of affairs, but they are obviously not very desirable either. Here is a quote from a message that I sent to this list several years ago: "This turns out to be a poor apology, since it tends to gum-up the works for the whole sublibrary, eating performance and occupying a large piece of intellectual real estate. The error estimation code must, at the least, be factored out. More to the point, it should likely be discarded in the main line of development. Other notions of error control should be investigated." So I think we are in agreement about what is desirable. But I have only the vaguest notions of how to get there. Here is a further quote from the same message: "Similar to the notion of sublibrary dependence discussed elsewhere, the special functions should be hierarchically organized. The dependencies should be made clear, and foundational levels of ironclad functions should be made explicit. Such ironclad functions should occupy the same place in the users mind as platform-native implementations of sin(x); they should be beyond question for daily use. Other functions, which suffer from implementation problems or are simply too complicated to guarantee the same level of correctness should be explicitly identified, as part of the design (not just the documentation!)." For example, functions which are ironclad should not bother to do error estimates. Functions which are harder to control should do something to indicate how they are behaving, whether this means returning an error estimate, or just returning some kind of "loss of accuracy" flag. Of course, the heuristics needed to identify loss of accuracy might be just as expensive as or equivalent to an actual error estimate. Functions which are "just plain nuts" might be pushed into an "experimental" sublibrary. Along with the problems involving variable quality, there is some actual intellectual confusion in this area as well. Examples of this are the versions of trig functions which return error estimates. Not only should these be among the ironclad functions (and probably not even provided by the library... what is the point), but the notion of "error estimate" expressed there is just confused. What those functions are trying to do is detect and report some form of ill-conditioning in the arguments, related to argument reduction. This is completely different from the theology expressed in the best-possible specification above. In that specification, the argument x is a known machine represented number, so there is no question of its conditioning. I remember writing these trig functions and puzzling over what I was doing. I was trying to fix some instability problem in the testing of functions which depended on trig evaluation; the ill-conditioning for large arguments was making it hard to create stable tests. But this was simply a problem with the tests and not with the functions. That is one day that I should have just sat on my hands and typed nothing; we would have been better off in the long run. I think the problem of argument conditioning is fundamentally a client-side issue. We should probably do nothing about it. If the client wants to evaluate sin(1.0e300), then it is probably not our problem. We should just do our best to return f(x) for the given machine-represented x. Ok, back to the real problem of error estimation and variable quality. What constitutes an ironclad function? In essence any function of a single variable (with no other parameters) must be ironclad. At the very least, it can be expressed with some form of Chebyshev expansion, argument transformation, etc, which should meet the best-possible specification. In other cases, there is an efficient high-quality algorithm which makes the evaluation easy, like the Lanczos algorithm for gamma(z). This includes functions of a single complex variable which it should be possible to treat in the same manner. What about functions with one "parameter", like besselK(nu, x)? These are harder and tend to have problems with corner cases or large arguments. But many of these should also be ironclad. Most of them have well-known stable methods, and the implementation only needs to take some care to get everything right. What about besselJ(nu, x)? This should also be ironclad. Although it may have some problems with argument conditioning, which can arise for many oscillatory functions, we have decided that argument-conditioning is not a problem we can address. So it should be no harder than sin(x), in principle. Stable iterative methods are applicable in many cases. Continued fraction methods are good, and I use these in many places. But they can also fail mysteriously. Gautschi has written some classic papers on this subject. Asymptotic methods are needed to fill the gaps. The tradeoff between iteration and asymptotic expansion has not been studied well enough. Certainly not by me. It would be best if the tradeoff point were detected automatically in some way, by the implementation. Otherwise, it becomes harder to understand what the code is doing, with many hard-wired constants, etc, that control the switch on methods. These matching problems are really tedious to think about explicitly; the machine should be tasked to figure that out. What about functions with more parameters? First the good news. All the elliptic functions and related elementary functions should be ironclad. A great deal is known about the evaluation of these. At the very least, you can always fall back on the arithmetic-geometric mean, which is stable and correct, though maybe not always the most efficient. Now the bad news: hypergeometric functions, including the regular and confluent functions. These are terrible. Probably all must go to the "experimental" ghetto. Functions with more parameters are harder to test as well. Maybe even impossible to test. Certainly, blind testing seems useless. You have to know what errors to look for. Another area of doubtful intellectual content: the "mode" argument for some of the evaluations. At the time, I wanted to be able to decrease accuracy at runtime, for performance gain in return. The obvious thing to do was to control the order of Chebyshev expansions. Fewer coefficients should be faster right? Well, who knows. It was never really studied properly. And it was only applicable for functions which depend on underlying Chebyshev expansions. Probably just a bad idea. The main library type is 'double'; the functions should probably just run as close to double precision as possible. Architectures have changes a lot since then as well, so spending extra cycles on some small local computations is not a big deal. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: error estimates 2016-03-25 21:24 ` error estimates (was: GSL K0/K1) Gerard Jungman @ 2016-03-25 21:37 ` Gerard Jungman 0 siblings, 0 replies; 7+ messages in thread From: Gerard Jungman @ 2016-03-25 21:37 UTC (permalink / raw) To: gsl-discuss Sorry, I accidentally hit the send button and the message was cut off. Oi. Anyway, that's basically all I had to say, at least for now. There are obviously some very difficult questions about how to organize the code, factor things properly, and create directed and useful tests. In my dreams, I would like to see a code base that not only burps out function values on request but also makes itself useful in other ways. Often clients want arrays of values, for sequences of parameters, such as 'ell' for Legendre functions or 'nu' for Bessel functions Such arrays are always computed using some iterative procedure, which is essentially the same procedure as used to compute single values. The library already has some "_array" functions that do some of these. But the best possible thing is to expose the actual recursion to the client, so that they can manipulate it and get what they want. This would also simplify the design of the underlying library code, which could reuse the same facilities. There are similar design questions about computing zeros of functions. Almost always needed in sets of many, and also closely related to other problems of interest to the client. Maybe one day, all this stuff could be made better. -- G. Jungman ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2016-03-26 5:43 UTC | newest] Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- [not found] <56F42FED.8060500@colorado.edu> 2016-03-24 18:22 ` GSL K0/K1 Patrick Alken 2016-03-25 20:03 ` Gerard Jungman 2016-03-25 21:18 ` Patrick Alken 2016-03-26 5:43 ` Pavel Holoborodko 2016-03-25 22:20 ` Brian Gladman 2016-03-25 21:24 ` error estimates (was: GSL K0/K1) Gerard Jungman 2016-03-25 21:37 ` error estimates Gerard Jungman
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).