Feature #2148

GSkyDir does not cache ra,dec or l,b values when they are computed

Added by Cardenzana Josh almost 7 years ago. Updated almost 7 years ago.

Status:ClosedStart date:07/04/2017
Priority:NormalDue date:
Assigned To:Cardenzana Josh% Done:

100%

Category:-
Target version:1.3.1
Duration:

Description

When a GSkyDir object is created, it must be passed either ra,dec or glon, glat (i.e. l,b) values. As a result, the other coordinates are not cached. Here is an example:

GSkyDir obs, dir;
obs.radec(0, 0);
dir.lb(0, 0);
double dist = obs.dist(dir);

Here, obs is created with celestial coordinates (as all observations do) and dir is created with galactic coordinates. When the 'dist()' is computed, dir has to have its coordinates converted to celestial before the cosine of the distance is computed. Here is the code snippet that does the computation:
        if (dir.m_has_radec) {
            #if defined(G_SINCOS_CACHE)
            if (!dir.m_has_radec_cache) {
                dir.m_has_radec_cache = true;
                dir.m_sin_dec         = std::sin(dir.m_dec);
                dir.m_cos_dec         = std::cos(dir.m_dec);
            }
            cosdis = m_sin_dec * dir.m_sin_dec +   // This section should be run from the
                     m_cos_dec * dir.m_cos_dec *   // second time this function is called onward
                     std::cos(dir.m_ra - m_ra);
            #else
            cosdis = std::sin(m_dec) * std::sin(dir.m_dec) +
                     std::cos(m_dec) * std::cos(dir.m_dec) *
                     std::cos(dir.m_ra - m_ra);
            #endif
        }
        else {
            #if defined(G_SINCOS_CACHE)
            cosdis = m_sin_dec * sin(dir.dec()) +   // This section is run EVERY time, but
                     m_cos_dec * cos(dir.dec()) *   // should only be run the first time
                     std::cos(dir.ra() - m_ra);
            #else
            cosdis = sin(m_dec) * sin(dir.dec()) +
                     cos(m_dec) * cos(dir.dec()) *
                     std::cos(dir.ra() - m_ra);
            #endif
        }

For each call to 'GSkyDir::dec()' and 'GSkyDir::ra()' the method has to run 'GSkyDir::gal2equ()' and take the cosine of the declination, both of which are expensive computations. In the above, that’s three calls to 'gal2equ()' and two calls to 'cos()'. For map cube computations, which loop over every bin for every observation, this becomes very expensive.

This is actually unnecessary as the data members 'm_ra’ and 'm_dec’ are filled when calling 'dir.ra()' or 'dir.dec()' the first time (similarly for galactic coordinates), so all that needs to be done is to update the call methods to set either 'm_has_radec’ or 'm_has_lb’ to true. This addition should be very low overhead, but potentially dramatic speedups in computation time when the bin sky directions are cached.


Recurrence

No recurrence.


Related issues

Related to ctools - Feature #2147: Reduce computation time of ctmodel and parallelize using ... Closed 07/04/2017

History

#1 Updated by Cardenzana Josh almost 7 years ago

  • Related to Feature #2147: Reduce computation time of ctmodel and parallelize using OpenMP. added

#2 Updated by Cardenzana Josh almost 7 years ago

  • Status changed from In Progress to Pull request
  • % Done changed from 0 to 100

After implementing the above change, the following time difference is produced in ctmodel running the following command:

ctmodel debug=yes inmodel=../1dc.south/models/model_bkg.xml inobs=../1dc.south/obs/obs_egal_baseline.xml 
incube=egalsrvy_01_datamap.fits outcube=test_ctmodel_2148.fits

This runs over the extragalactic survey data from the 1dc using a pre-generated cube (from ctbin) to define the bin information. The model cube is generated from only the background “CTAIrfBackground” model.

devel:

...
2017-07-04T13:31:07: +============+
2017-07-04T13:31:07: | Model cube |
2017-07-04T13:31:07: +============+
2017-07-04T13:31:07: === GCTAEventCube ===
2017-07-04T13:31:07:  Number of events ..........: 21746400
2017-07-04T13:31:07:  Number of elements ........: 67200000
2017-07-04T13:31:07:  Number of pixels ..........: 3360000
2017-07-04T13:31:07:  Number of energy bins .....: 20
2017-07-04T13:31:07:  Time interval .............: 59215.5 - 59256.8548611111 days
2017-07-04T13:31:07: === GEbounds ===
2017-07-04T13:31:07:  Number of intervals .......: 20
2017-07-04T13:31:07:  Energy range ..............: 100 GeV - 100 TeV
2017-07-04T13:31:07: === GSkyMap ===
2017-07-04T13:31:07:  Number of pixels ..........: 3360000
2017-07-04T13:31:07:  X axis dimension ..........: 2800
2017-07-04T13:31:07:  Y axis dimension ..........: 1200
2017-07-04T13:31:07:  Number of maps ............: 20
2017-07-04T13:31:07:  Shape of maps .............: (20)
...
2017-07-04T13:31:14: Application "ctmodel" terminated after 17168 wall clock seconds, consuming 16966.7 seconds 
of CPU time.

2148 branch:

...
2017-07-04T12:06:59: === GCTAEventCube ===
2017-07-04T12:06:59:  Number of events ..........: 21746400
2017-07-04T12:06:59:  Number of elements ........: 67200000
2017-07-04T12:06:59:  Number of pixels ..........: 3360000
2017-07-04T12:06:59:  Number of energy bins .....: 20
2017-07-04T12:06:59:  Time interval .............: 59215.5 - 59256.8548611111 days
2017-07-04T12:06:59: === GEbounds ===
2017-07-04T12:06:59:  Number of intervals .......: 20
2017-07-04T12:06:59:  Energy range ..............: 100 GeV - 100 TeV
2017-07-04T12:06:59: === GSkyMap ===
2017-07-04T12:06:59:  Number of pixels ..........: 3360000
2017-07-04T12:06:59:  X axis dimension ..........: 2800
2017-07-04T12:06:59:  Y axis dimension ..........: 1200
2017-07-04T12:06:59:  Number of maps ............: 20
2017-07-04T12:06:59:  Shape of maps .............: (20)
...
2017-07-04T12:07:07: Application "ctmodel" terminated after 6332 wall clock seconds, consuming 6284.84 seconds 
of CPU time.

This represents about 2.5x speed up with no difference in the produced model cubes. The biggest time computation in ctmodel in the above run is still in 'GSkyDist::dist()' at the end when computing 'gammalib::acos()' (which uses 'std::asin()'), followed closely by 'std::cos()'. However, I’m not sure there’s any way to abstract this out without reducing precision in the computation.

Pull branch:
josh cardenzana / gammalib: 2148-improve_gskydir_caching

#3 Updated by Knödlseder Jürgen almost 7 years ago

  • Target version changed from 1.4.0 to 1.3.1

I’m about to merge in the code. I moved the setting of the flags in the GSkyDir::gal2equ() and GSkyDir::equ2gal() methods since this simplifies the code and then also covers more methods.

#4 Updated by Knödlseder Jürgen almost 7 years ago

  • Status changed from Pull request to Closed

Merged into bugfix-1.3.1 branch. Will also merge into devel branch.

Also available in: Atom PDF