Refactoring wavelet organisation indices (woi) by martinjanssens · Pull Request #32 · cloudsci/cloudmetrics

martinjanssens · 2021-09-07T08:09:24Z

Begun refactoring this by using a single function that computes both the wavelet transformation and the metrics. We could make this more similar to the Fourier metrics or to the object metrics, as mentioned in #28 (comment), and we should probably make a decision on that before I continue with this.

leifdenby

Thanks for starting work on this!

leifdenby · 2021-09-08T09:51:51Z

cloudmetrics/metrics/woi.py

+    """
+    Computes the three Wavelet Organisation Indices WOI1, WOI2, WOI3 proposed by 
+    Brune et al. (2018) from the stationary/undecimated Direct Wavelet Transform
+    of a scalar field.


Could you include the DOI or URL to the paper (https://doi.org/10.1002/qj.3409)?

leifdenby · 2021-09-08T09:52:54Z

cloudmetrics/metrics/woi.py

+    # Compute wavelet coefficients
+    scale_max = int(np.log(cloud_scalar.shape[0]) / np.log(2))
+    coeffs = pywt.swt2(cloud_scalar, wavelet, scale_max, norm=True, trim_approx=True)
+    # Bug in pywt -> trim_approx=False does opposite of its intention


Did you log this on the pywt repo? Sounds like they might want to know that :)

leifdenby · 2021-09-08T09:53:33Z

cloudmetrics/metrics/woi.py

+    separation_scale : int, optional
+        Which power of 2 to use as a cutoff scale that separates 'small' scales
+        from 'large' scales. The default is 5; i.e. energy contained in scales
+        larger than 2^5=32 pixles is considered 'large-scale energy'.


cloudmetrics/metrics/woi.py

martinjanssens · 2021-09-09T16:24:57Z

Nice job finding that R-package, I don't think that was around when I wrote this script the first time! Consequently, my implementation is just an attempt to match what I could get from the original paper's text. At a glance, the two approaches have some methodological differences, though I don't think they're major. The R-package does handle non-periodic BCs very nicely by mirroring the fields, and chooses to zoom to an appropriate level (the field needs to be of a shape that is a power of 2), while we're padding periodically to get there. I think I'd like to include these two aspects, at least, so I'll try to rewrite accordingly. Also, do you have an opinion on whether we want this to be a single function that returns three indices or three functions returning one index each?

leifdenby · 2021-09-10T10:40:42Z

At a glance, the two approaches have some methodological differences, though I don't think they're major. The R-package does handle non-periodic BCs very nicely by mirroring the fields, and chooses to zoom to an appropriate level (the field needs to be of a shape that is a power of 2), while we're padding periodically to get there. I think I'd like to include these two aspects, at least, so I'll try to rewrite accordingly.

That sounds great!

Also, do you have an opinion on whether we want this to be a single function that returns three indices or three functions returning one index each?

I'm about unsure what to do here. It's quite nice to only have one number returned by default since it makes analysis further down the pipeline simpler. I would probably like to have a dataset with woi1, woi2, etc in it together with other variables I was studying. I can see two ways of dealing with this:

The convention in numpy seems to be return single numbers by default and then more information can be requested (for example the full argument in https://numpy.org/doc/stable/reference/generated/numpy.polyfit.html). Are one of the three woi coefficients generally more interesting/meaningful? If so we could return one of them and have parameter called coefficient with call options like .woi(cloud_mask, coefficient=1), .woi(cloud_mask, coefficient=2) and .woi(cloud_mask, coefficient='all').
Another option would be to have functions called woi1(...), woi2(...), woi3(...) following the pattern of object geometry properties the calculations could be cached.

What do you think?

…ans cached output from a single evaluation of the stationary wavelet transform

…ion?

martinjanssens · 2021-09-10T17:33:49Z

I'd be curious to hear what you think of this when you have time, it's basically trying to do your second option (I need to improve the tests still). :)

leifdenby · 2021-09-13T16:28:52Z

cloudmetrics/metrics/woi.py

+_CACHED_VALUES = dict()
+
+
+def _get_swt(cloud_scalar, pad_method, wavelet, separation_scale):


:) We can put this kind of thing into a decorator at some point. I would use functools.lru_cache but that doesn't work with just numpy arrays because they can't always be serialized.

leifdenby

This is simply awesome, nice work! I think we should get this merged in. Remember to add an entry to the changelog

Started refactoring woi

b18dc5e

martinjanssens mentioned this pull request Sep 7, 2021

Refactoring of cloud-metric routines #20

Open

16 tasks

martinjanssens changed the title ~~Started refactoring woi~~ Refactoring wavelet organisation indices (woi) Sep 7, 2021

leifdenby reviewed Sep 8, 2021

View reviewed changes

martinjanssens added 3 commits September 10, 2021 18:35

Separate functions for woi1, woi2 and woi3, each accessing the poor-m…

59cec71

…ans cached output from a single evaluation of the stationary wavelet transform

Simple test of SWT - do spectra output directionally correct informat…

de6e05d

…ion?

Apply black

be790c2

leifdenby reviewed Sep 13, 2021

View reviewed changes

leifdenby approved these changes Sep 13, 2021

View reviewed changes

martinjanssens added 3 commits September 14, 2021 10:52

Extended tests to test each woi component

0ff1737

Merged in refactored cloud_fraction

4e8d372

Updated changelog

c4200b0

martinjanssens merged commit c924528 into cloudsci:master Sep 14, 2021

martinjanssens mentioned this pull request Sep 14, 2021

Versioning and zenodo #35

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring wavelet organisation indices (woi)#32

Refactoring wavelet organisation indices (woi)#32
martinjanssens merged 7 commits intocloudsci:masterfrom
martinjanssens:refactor-woi

martinjanssens commented Sep 7, 2021

Uh oh!

leifdenby left a comment

Uh oh!

leifdenby Sep 8, 2021

Uh oh!

leifdenby Sep 8, 2021

Uh oh!

leifdenby Sep 8, 2021

Uh oh!

Uh oh!

martinjanssens commented Sep 9, 2021

Uh oh!

leifdenby commented Sep 10, 2021

Uh oh!

martinjanssens commented Sep 10, 2021

Uh oh!

leifdenby Sep 13, 2021

Uh oh!

leifdenby left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		_CACHED_VALUES = dict()


		def _get_swt(cloud_scalar, pad_method, wavelet, separation_scale):

Conversation

martinjanssens commented Sep 7, 2021

Uh oh!

leifdenby left a comment

Choose a reason for hiding this comment

Uh oh!

leifdenby Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

leifdenby Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

leifdenby Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

martinjanssens commented Sep 9, 2021

Uh oh!

leifdenby commented Sep 10, 2021

Uh oh!

martinjanssens commented Sep 10, 2021

Uh oh!

leifdenby Sep 13, 2021

Choose a reason for hiding this comment

Uh oh!

leifdenby left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants