[WIP] Add Tutorial and Derivations Notebooks for VALMOD #585 #586

NimaSarajpoor · 2022-04-06T11:19:24Z

This notebook addresses issue #585. In this notebook, we would like to implement the VALMOD method proposed in VALMOD_2018 and VALMOD_2020.

What I have done so far:

Provide introduction that provides gist of concept proposed in the paper
Calculate Lower-Bound distance profile after correcting the typo in eq(1) of paper....and verify the calculation with a np.random.uniform time series data.

For now, I calculated LB given q>0 (see eq(2) in paper.) However, we still need to find LB when q <= 0.

review-notebook-app · 2022-04-06T11:19:28Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

NimaSarajpoor · 2022-04-06T11:22:57Z

@seanlaw
Please allow me some time to see if I can calculate LB for q<=0 (see eq(2) of paper). I will let you know when I am done...

NimaSarajpoor · 2022-04-06T17:35:45Z

I had a miscalculation. Although there is a typo in the paper, it seems the eq(2) of paper is correct. I fixed the typo of paper when I was doing calculation. However, I had a miscalculation somewhere else...so I corrected it and I got the eq(2)... for q>0. I will fix the notebook. (so, I assume the equation should be correct for case q<=0 as well).

codecov-commenter · 2022-04-06T23:49:57Z

Codecov Report

Patch and project coverage have no change.

Comparison is base (275b998) 99.24% compared to head (f6126ca) 99.24%.

📣 This organization is not using Codecov’s GitHub App Integration. We recommend you install it so Codecov can continue to function properly for your repositories. Learn more

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #586   +/-   ##
=======================================
  Coverage   99.24%   99.24%           
=======================================
  Files          82       82           
  Lines       12956    12956           
=======================================
  Hits        12858    12858           
  Misses         98       98

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

NimaSarajpoor · 2022-04-07T00:01:36Z

@seanlaw

Notebook is ready. The notebook covers the first 12 pages of VALMOD_2020 paper. I FIXed my miscalculation and things are good! I also implemented the Low-Bound distance profile function for now to see how it performs (we may use it later in VALMOD algorithm).

NimaSarajpoor · 2022-04-09T21:16:26Z

Just wanted to let you know that you can ignore the function '_calc_LB_dist_profile' at the end of notebook (it is working..but I think it is not clean. I may probably remove it as VALMOD algorithm does not use such function. I just created it to get Lower-Bound of distance profile for now to show the result)

seanlaw · 2022-04-09T23:50:07Z

I will first need to go over the initial 12 pages myself and then I will review the notebook :)

seanlaw · 2022-04-10T15:07:07Z

@NimaSarajpoor I've gone over your notebook quickly but haven't verified the derivation. Usually, with derivations, I like to write things out fully without skipping any steps (see https://github.com/TDAmeritrade/stumpy/blob/main/docs/Matrix_Profile_Derivation.ipynb). Some of your equations don't seem to be rendering for me and it's a bit hard for me to follow. I can try to find some time to work through the derivation to verify your work if that's helpful?

NimaSarajpoor · 2022-04-10T17:16:57Z

I see. Please let me re-write it. I will try to follow the same approach/style you used in the link you provided. I will check ReviewNB and if it is rendered well, I will let you know. Sounds good?
(Btw, is it necessary to provide the derivation? I did that because of the typo in the paper, but then I realized the result is the same as what provided in eq(2) for q>0)

seanlaw · 2022-04-10T17:26:50Z

Yes, that would be great!

Personally, I think writing out the derivation clearly will help (me) and others reduce any doubt in understanding. Also, I find that it provides an opportunity to help maintain consistency in the code regarding variable names.

NimaSarajpoor · 2022-04-11T04:42:01Z

Weird...still not rendering well.... please let me do some investigation on my end to see what's going on...

…ing issue

NimaSarajpoor · 2022-04-11T07:13:43Z

@seanlaw
So, I tried the Welford_Review/ Matrix_Profile_Derivation/ Pearson notebooks to see if ReviewerNB on Github can render them. Unfortunately, it seems that it cannot render them properly for the first two notebooks. Peason notebook is rendered properly though!

I guess you wrote the notebooks on your end and pushed them to the repo...and things were good when rendered locally in .ipynb. Did you, by any chance, try to check your notebooks via ReviewerNB of Github?

It seems the problem is related to the ReviewerNB of Github. I enclosed the math equations with $$ and it seems the problem is resolved. Well, almost resolved! I checked out the ReviewNB here and it seems there is still one error. I pushed the same notebook to a test_repo I created and it seems that single error does not appear when I checked it out with ReviewerNB.

my_test_repo

NimaSarajpoor · 2022-04-11T07:29:33Z

@seanlaw
Just for the records:
if I click on the ReviewNB purple button (on top of this PR page), it seems there is still one error related to rendering. However, when I clicked on ReviewNB blue hyperlink (on top of this PR page just below the purple button), and navigated to the notebook from my fork of STUMPY, everything seems to be fine and there is no error in rendering....

STUMPY_my_fork

seanlaw · 2022-04-11T12:21:18Z

Sounds good

docs/Tutorial_VALMOD.ipynb

seanlaw · 2022-04-11T13:11:53Z

Apologies, these comments are for an older commit. I forgot to hit "Finish Review" along with my last comment.

docs/Tutorial_VALMOD.ipynb

seanlaw · 2022-04-11T14:11:50Z

@NimaSarajpoor I provided some comments and stopped at the "Expanding (3)" line

NimaSarajpoor · 2022-04-11T16:24:21Z

@seanlaw
Thanks for the comments!

Apologies, these comments are for an older commit. I forgot to hit "Finish Review" along with my last comment.

I found two of those comments in the ReviewNB. Maybe they got mixed together (?). I will address those two comments and then I ignore/resolve the rest. Please let me know if I miss anything.

NimaSarajpoor · 2022-04-12T03:30:42Z

I think we are all set. I can push commits after revising the notebook.

NimaSarajpoor · 2022-04-25T07:35:50Z

@seanlaw
please free to review.

Improved section 2: Lower Bound of Distance Profile
Added section 3: Core idea to briefly explain VALMOD
Added section 4: VALMOD algorithm. ( Implemented algorithm3 of paper)

docs/Tutorial_VALMOD.ipynb

- Fix typos - replace variable name n with k

NimaSarajpoor · 2022-04-27T01:00:04Z

@seanlaw

We basically need to spend some time figuring out how to allow stumpy.stump , stumpy.stumped , and stumpy.gpu_stump return top-k nearest neighbors.

So, should I now go and study stump/stumped/gpu_stump? And, then try to change all of them to return top-k nearest neighbors?

seanlaw · 2022-04-27T01:10:25Z

@NimaSarajpoor Yes, I also added a new issue #592 where we can discuss it in more detail

NimaSarajpoor · 2022-06-22T20:28:46Z

A couple of notes that are confirmed by the main author of VALMOD:

On Page 13, in Algorithm 2: update VALMP, line 3 should have been: if VALMP.normdistances[i] > lNormDist or ...
On page 16, the Algorithm 4: ComputeSubMP can be further optimized by updating minDistABS in the for-loop (line 28)

(This is to make sure that we do not lose this information later)

seanlaw · 2023-01-31T15:20:45Z

So, should I now go and study stump/stumped/gpu_stump? And, then try to change all of them to return top-k nearest neighbors?

We've come along way @NimaSarajpoor! I wonder how easy/hard it would be to implement VALMOD now that we have top-k nearest neighbors?

NimaSarajpoor · 2023-02-01T05:40:18Z

@seanlaw

We've come along way

We have indeed!

I wonder how easy/hard it would be to implement VALMOD now that we have top-k nearest neighbors?

I took a quick look at the paper. I don't remember the details but I think the first four algorithms are the core ones. The first two algorithms are easy. The third one is already done (the top-k feature added to STUMPY). In my opinion, the main remaining task is algorithm 4. I think its implementation should be straightforward.

NimaSarajpoor · 2023-03-07T10:01:43Z

I think the algorithm presented in a paper has a flaw. (I sent an email to the main author and am waiting for his response.)

In the page14 of the paper, the following can be read:

Algorithm 2 shows the routine to update the VALMP structure. The final VALMP consists of four parts. The i the entry of the normDistances vector stores the smallest length-normalized Euclidean distance values between the ith subsequence and its nearest neighbor, while the ith place of vector distances stores their straight Euclidean distance. The location of each subsequence’s nearest neighbor is stored in the vector indices. The structure lengths contains the length of the i th subsequences pair.

Let's assume P denotes the variable-length matrix profile obtained by VALMOD. According to my investigation P cannot be exact. Athough min(P) is exact, P itself is not exact. (see algorithm 4, line 29)

It is also possible that the author is aware of this, and considered it in other algorithms provided in the paper, like the ones presented for discovering motifs / discords. In other words, the aforementioned paragraph might just be written badly.

NimaSarajpoor · 2026-01-03T05:22:59Z

I think the algorithm presented in a paper has a flaw. (I sent an email to the main author and am waiting for his response.)

Athough min(P) is exact, P itself is not exact. (see algorithm 4, line 29)

It is also possible that the author is aware of this

The author, Michelle Linardi, confirmed that. The exact quote is provided below:

VALMP is guaranteed to contain just the K smallest true distances , where k >= 1. As you pointed out, it can contain approximate distances, but never the one of the best motif that is theoretically guaranteed to be true.

As a side:
I've started reviewing this work. Noticed the tutorial notebook is not easy to follow. Improved the story, and added an example to help reader understand the LowerBound (LB) in VALMOD. Will continue the review.

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


the authors meant to avoid considering two subsequences (of different length) that start from the same index.

Add reference please

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Remove "Note"

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Maybe breakdown this sentence as it was not easy to digest in the first attempt.

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Add the missing "Z-"

"What about Z-normalized euclidean distance?"

===

$..LB$, where the LowerBound is .......

===

If we follow the notation in the paper, this lower bound can be...

Instead, simply say: "The paper uses the notation .... However, here, we prefer to add subscript m to show the subsequence length based on which the LB is calculated.

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


No need to mention this formula as I think we are not going to use it later (and when we do, we can mention it then). It is better to replace it with a formula that shows the relationship between rho and z-normalized distance.

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


The "THEN" part is huge. Better to provide the "WHAT" part, and then the "HOW"/"WHY" part (the proof)

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Maybe add a note that the VALMOD's output is NOT the same as naive_VALMOD because, IIRC, VALMOD provides approximate matrix profile that has exact value for discord(s)

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Line #41. # clipping ρ
Remove unnecessary comment. Instead, maybe add comment above Line #40:

# Compute LowerBound (LB) factor

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Line #45. is_mp_valid[:] = True
Do we need to include this output inside function? This is always all True. Maybe we should use it outside of this function, maybe in the caller function.

Reply via ReviewNB

NimaSarajpoor · 2026-01-06T00:53:32Z

docs/Tutorial_VALMOD.ipynb

@@ -0,0 +1,1056 @@
+{


Line #50. def _VALMOD_stump_partial(T, m, k, LB_σr, LB_I):
This function is very hard to follow. Start with the original algorithm (see Algorithm 4), and see if it can be broken down to smaller pieces.

Reply via ReviewNB

Add Tutorial Notebook for VALMOD

1f409f9

Fix calculation and implement Lower-Bound distance profile

53774a5

Merge branch 'main' into Tutorial_VALMOD

5b5216c

NimaSarajpoor added 2 commits April 10, 2022 17:21

improve markdowns

ca04fae

Major Revise of Notebook

c6b0868

NimaSarajpoor added 2 commits April 11, 2022 00:37

Use 2662 to enclose math equations written in latex to resolve render…

d21061a

…ing issue

modify math eq latex in markdown

88503ed

seanlaw reviewed Apr 11, 2022

View reviewed changes

NimaSarajpoor added 3 commits April 11, 2022 22:49

add twin_freak explanation

1a87295

add lower-bound calculation for non-normalized p-norm distance

00377bd

major improvement in derivation of lower-bound

f38e2e5

NimaSarajpoor added 7 commits April 24, 2022 22:02

Elaborate sections 2,3,4

ab36c8a

Improve readability

4ae4d26

Correct grammer and typo

450ddc6

Merge branch 'main' into Tutorial_VALMOD

1ff9947

proof read

15e5a14

Removed unrecognized latex code

c8653e2

minor changes

d924fc1

seanlaw reviewed Apr 26, 2022

View reviewed changes

docs/Tutorial_VALMOD.ipynb Show resolved Hide resolved

docs/Tutorial_VALMOD.ipynb Show resolved Hide resolved

docs/Tutorial_VALMOD.ipynb Show resolved Hide resolved

NimaSarajpoor added 5 commits April 26, 2022 17:03

minor changes

df84ded

- Fix typos - replace variable name n with k

replace heapq with np.searchsorted

7d38e39

Fix typo

cbe649f

Test P and I of _VALMOD_stump

34edea8

minor changes

b4b9f69

NimaSarajpoor mentioned this pull request Apr 27, 2022

Add Top-K Nearest Neighbors to Matrix Profile (normalize=True) #592

Closed

NimaSarajpoor added 5 commits February 26, 2023 10:02

Merge branch 'main' into Tutorial_VALMOD

1cd81af

Implement VALMOD-draft version

c6dd58d

implement naive valmod and minor changes

ada4fdf

Merge branch 'main' into Tutorial_VALMOD

6280aa0

minor changes

f6126ca

NimaSarajpoor and others added 2 commits January 2, 2026 19:32

Merge branch 'stumpy-dev:main' into Tutorial_VALMOD

4254440

Enhance Tutorial for VALMOD

764ec9d

NimaSarajpoor commented Jan 6, 2026

View reviewed changes

[WIP] Add Tutorial and Derivations Notebooks for VALMOD #585 #586

Are you sure you want to change the base?

[WIP] Add Tutorial and Derivations Notebooks for VALMOD #585 #586

Uh oh!

Conversation

NimaSarajpoor commented Apr 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Apr 6, 2022

Uh oh!

NimaSarajpoor commented Apr 6, 2022

Uh oh!

NimaSarajpoor commented Apr 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Apr 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

NimaSarajpoor commented Apr 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NimaSarajpoor commented Apr 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanlaw commented Apr 9, 2022

Uh oh!

seanlaw commented Apr 10, 2022

Uh oh!

NimaSarajpoor commented Apr 10, 2022

Uh oh!

seanlaw commented Apr 10, 2022

Uh oh!

NimaSarajpoor commented Apr 11, 2022

Uh oh!

NimaSarajpoor commented Apr 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NimaSarajpoor commented Apr 11, 2022

Uh oh!

seanlaw commented Apr 11, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seanlaw commented Apr 11, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seanlaw commented Apr 11, 2022

Uh oh!

NimaSarajpoor commented Apr 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NimaSarajpoor commented Apr 12, 2022

Uh oh!

NimaSarajpoor commented Apr 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NimaSarajpoor commented Apr 27, 2022

Uh oh!

seanlaw commented Apr 27, 2022

Uh oh!

NimaSarajpoor commented Jun 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanlaw commented Jan 31, 2023

Uh oh!

NimaSarajpoor commented Feb 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

NimaSarajpoor commented Apr 6, 2022 •

edited

Loading

NimaSarajpoor commented Apr 6, 2022 •

edited

Loading

codecov-commenter commented Apr 6, 2022 •

edited

Loading

NimaSarajpoor commented Apr 7, 2022 •

edited

Loading

NimaSarajpoor commented Apr 9, 2022 •

edited

Loading

NimaSarajpoor commented Apr 11, 2022 •

edited

Loading

NimaSarajpoor commented Apr 11, 2022 •

edited

Loading

NimaSarajpoor commented Apr 25, 2022 •

edited

Loading

NimaSarajpoor commented Jun 22, 2022 •

edited

Loading

NimaSarajpoor commented Feb 1, 2023 •

edited

Loading

NimaSarajpoor commented Mar 7, 2023 •

edited by seanlaw

Loading

NimaSarajpoor commented Jan 3, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading

NimaSarajpoor Jan 6, 2026 •

edited

Loading