Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Autocomplete/search results need to be tuned to be maximally intuitive #132

Open
kltm opened this issue Jul 11, 2014 · 7 comments
Open

Autocomplete/search results need to be tuned to be maximally intuitive #132

kltm opened this issue Jul 11, 2014 · 7 comments

Comments

@kltm
Copy link
Member

kltm commented Jul 11, 2014

This covers a lot or ground, but the idea is that autocomplete and search results should be very intuitive for the majority of users.

The bulk of the solution may be connected to #131, and the simplified constructs around it. The rest will likely be tuning the new fields/parameters to get the desired "intuitive" results.

@kltm
Copy link
Member Author

kltm commented Jul 11, 2014

Once the engineering bits of #131 are handled, we'll need a consistent target list (all desired outcomes are possible) to work with and try to achieve it with tuning.

@kltm
Copy link
Member Author

kltm commented Oct 23, 2014

Val's cdc2 vs. cdc20 is another good example.

@cmungall
Copy link
Member

Here's the main issue. There are dozens of genes called foo. The labels is unique (more or less...) within a species, but shared across species.

A user typing 'foo' sees all these in a random order. Sometimes you can guess what the species is by whether it's foo Foo or FOO. But this is rubbish. This will get worse as we add more organisms. The organism most people want (e.g. human) may be buried beyond the available list.

  1. Add species in parentheses. Can be done upstream in Golr loading (I think my preference) or display/AC logic. Also with the upstream method I think savvy users can streamline their search (e.g. "foo mus")
  2. There should be an ordering that prioritizes humans above e.g. armadillos. Not sure how best to do this. We could add a 'relevancy' field to the schema. Golr loading would take care of filling this in. This would be generally useful. E.g. in term AC GO would be more relevant than others.

@kltm
Copy link
Member Author

kltm commented Oct 23, 2014

1 is essentially #24.
2 will require some fairly low-level goosing, likely a form of #131 and #132.

@kltm
Copy link
Member Author

kltm commented Nov 3, 2014

This is fundamentally related to the search reworking in berkeleybop/bbop-js#16.

@kltm
Copy link
Member Author

kltm commented Nov 10, 2014

More good examples in:

http://jira.geneontology.org/browse/GO-633

@cmungall
Copy link
Member

Should we split this ticket to get separate specs for bioentities vs classes? There are similarities and differences. For bioentities the species issue is paramount (see comments above, the Noctua AC is a massive improvement).

The class requirements have not been stated, first pass:

  • obsolete classes should be ranked lower
  • synonyms should probably be ranked higher than they are, this is partly a nudge to accommodate the fact that GO is overly formal in its nomenclature at times. E.g. try getting "apoptosis" in AC - you can't. (hint: GO calls this "apoptotic process" and uses "apoptosis" as a synonym, contrary to normal biologist usage)
  • the primary ontology (GO for GO) should rank above others. Again, try "apoptosis" - none of the top ACs are in GO

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants