Allow computing the empty transitive path #1800

RobinTF · 2025-02-12T15:02:19Z

It seems like the code was already there, so all this PR does is removing the check.

sparql-conformance · 2025-02-12T16:58:59Z

Conformance check passed ✅

Test Status Changes 📊

Number of Tests	Previous Status	Current Status
3	Failed	Passed

Details: https://qlever.cs.uni-freiburg.de/sparql-conformance-ui?cur=f21e5611708a6b9e2f2ce86f217792b016173c44&prev=2fad76532e66d45fb9fd7063c6c563daa0eeea42

codecov · 2025-02-12T17:34:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 90.04%. Comparing base (2fad765) to head (f21e561).
Report is 7 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1800      +/-   ##
==========================================
- Coverage   90.05%   90.04%   -0.01%     
==========================================
  Files         396      396              
  Lines       37928    37921       -7     
  Branches     4262     4260       -2     
==========================================
- Hits        34156    34146      -10     
- Misses       2477     2481       +4     
+ Partials     1295     1294       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sonarqubecloud · 2025-02-12T18:42:08Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

hannahbast · 2025-02-12T19:34:33Z

@RobinTF Awesome that you are looking into this. Will this also remove false negative "This query might have to evaluate the empty path" exception, like for the following query?

PREFIX wd: <http://www.wikidata.org/entity/>
PREFIX wdt: <http://www.wikidata.org/prop/direct/>
SELECT * WHERE {
  wd:Q11629 wdt:P279/(wdt:P279* | wdt:P31*) | wdt:P31/(wdt:P279* | wdt:P31*) ?type .
}

https://qlever.cs.uni-freiburg.de/wikidata/Zb82ho

RobinTF · 2025-02-12T19:44:24Z

@hannahbast Depends on the query planning, I haven't tried any more complex queries. But this is the only "This query might have to evaluate the empty path" exception there is, so you definitely won't get this exact exception any more.
All I really did is analyse the code and I came to the conclusion that everything should just work if I remove the check, and my tests all did work as expected.

joka921

This needs a substantially differen implementation.

joka921 · 2025-02-12T20:25:09Z

src/engine/TransitivePathImpl.h

-    if (minDist_ == 0 && !isBoundOrId() && lhs_.isVariable() &&
-        rhs_.isVariable()) {
-      AD_THROW(
-          "This query might have to evaluate the empty path, which is "
-          "currently "
-          "not supported");
-    }


Unfortunately this is not quite correct.
The empty path must in principle contain everything, not only entities that are in some way contained in the Path with the *. This means, that we have to fix this in the TurtleParser.

hannahbast · 2025-02-12T20:33:27Z

@RobinTF OK, I just tried this on Wikidata and see what's happening. Let's take a simpler example query (prefixes omitted for brevity):

SELECT * WHERE {
  wd:Q11629 wdt:P31/(wdt:P279*|wdt:P31*) ?class
}

With this PR, the query no longer throws the "empty path" exception, but it also never finishes. The problem is that in the current implementation, QLever evaluates this just as it would evaluate the following query:

SELECT * WHERE {
  wd:Q11629 wdt:P31 ?tmp .
  { ?tmp wdt:P279* ?class } UNION { ?tmp wdt:P31* ?class }
}

But the result for each of the two { ... } is a table with one row per subject, for all subjects, that is of size 2.2 B for Wikidata.

That would eventually give the correct result but takes forever and is obviously not how the query should be evaluated.

RobinTF · 2025-02-12T20:48:08Z

@hannahbast Can you clarify? The issue you described seems just like poor query planning to me, where the query planner performs the join operation in a very inefficient position. If you replace the asterisk in your query with + you get the very same suboptimal query tree, the only difference is that it contains fewer results, so it doesn't crash. This doesn't seem to be an issue with the empty path if you ask me.

hannahbast · 2025-02-12T20:58:54Z

@RobinTF My point is the following:

Almost all of the real-world queries, for which QLever currently throws the "This query might have to evaluate the empty path" exception, are of the kind of my examples above.
Queries where you actually have to evaluate the empty path are very rare in practice. An simple example with a rather useless result would be SELECT * WHERE { ?x wdt:P31* ?y }. I have to think about a simple useful example.
Therefore, a PR that correctly evaluates the empty path but does not fix the query planning issue, would not make things better and rather worse (because queries that threw an exception before would then take forever without users understanding why).
I therefore think that the two issues (evaluating the empty path when there is no other way and avoiding the evaluation of the empty path unless there is no other way) should either be handled by the same change or the latter before the former.

RobinTF · 2025-02-12T21:04:35Z

@hannahbast Thank you for clarifying. I wanted to have a look at implementing negated paths anyways, so I spent some time today thinking about query planning of this particular case. So I might just start with the issue at hand first.
A small thought: If sort supported on-disk laziness (I talked with @joka921 briefly about this), then it would still take ages, but it would eventually finish

RobinTF · 2025-02-14T18:36:21Z

This needs some other stuff to happen beforehand

RobinTF added 2 commits February 12, 2025 15:07

Support empty transitive path

2406caf

Add Unit Tests

f21e561

joka921 requested changes Feb 12, 2025

View reviewed changes

RobinTF mentioned this pull request Feb 13, 2025

Minor refactorings in the query planning for property paths #1805

Merged

RobinTF closed this Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow computing the empty transitive path #1800

Allow computing the empty transitive path #1800

RobinTF commented Feb 12, 2025

sparql-conformance bot commented Feb 12, 2025

codecov bot commented Feb 12, 2025 •

edited

Loading

sonarqubecloud bot commented Feb 12, 2025

hannahbast commented Feb 12, 2025

RobinTF commented Feb 12, 2025

joka921 left a comment

joka921 Feb 12, 2025

hannahbast commented Feb 12, 2025 •

edited

Loading

RobinTF commented Feb 12, 2025

hannahbast commented Feb 12, 2025

RobinTF commented Feb 12, 2025

RobinTF commented Feb 14, 2025

Allow computing the empty transitive path #1800

Allow computing the empty transitive path #1800

Conversation

RobinTF commented Feb 12, 2025

sparql-conformance bot commented Feb 12, 2025

Conformance check passed ✅

Test Status Changes 📊

codecov bot commented Feb 12, 2025 • edited Loading

Codecov Report

sonarqubecloud bot commented Feb 12, 2025

Quality Gate passed

hannahbast commented Feb 12, 2025

RobinTF commented Feb 12, 2025

joka921 left a comment

Choose a reason for hiding this comment

joka921 Feb 12, 2025

Choose a reason for hiding this comment

hannahbast commented Feb 12, 2025 • edited Loading

RobinTF commented Feb 12, 2025

hannahbast commented Feb 12, 2025

RobinTF commented Feb 12, 2025

RobinTF commented Feb 14, 2025

codecov bot commented Feb 12, 2025 •

edited

Loading

hannahbast commented Feb 12, 2025 •

edited

Loading