33 zooma endp #1

Mil-m · 2023-07-23T23:31:16Z

https://app.zenhub.com/workspaces/cohort-atlas-642e179715832600114e7867/issues/gh/ebibiosamples/cohort-atlas/33

…he port / requirements.txt update)

theisuru

Thanks Milena, looks good mostly. I have added few comments. Let me know if you have any questions/comments on the chat.

theisuru · 2023-07-24T09:52:08Z

harmonise/match.py

+        pass
+
+    def get_field_dict(self, url):
+        z_cl = ZoomaClient()


Always good to use self explanatory names. For very short lived names it is acceptable sometimes. Here I would name this zooma_client rather than z_cl.

theisuru · 2023-07-24T09:55:23Z

harmonise/match.py

+                    self.field_dict['semanticTags'] = el['semanticTags']
+                    self.field_dict['confidence'] = el['confidence']
+                except Exception as e:
+                    print(e)


Should use a proper logging library. Default python logging module will do fine.

theisuru · 2023-07-24T10:36:41Z

uploads/sample_labels_to_annotate.csv

@@ -1,12 +1,29 @@
-id,name,label,description,type,values,parent,annotations,tags


We are not only receiving the 'labels' but a csv file that could contain field 'type', 'description', etc...
So the original file contains the possible CSV format. You can add more labels to the same file format.

do you suggest me to add more 'columns' there?

theisuru · 2023-07-24T10:38:58Z

tests/test_endpoints.py

+        print(f"Request failed with status code: {response.status_code}; file path: {file_path}")
+
+    assert len(outp_json) > 0, "Empty json"
+    assert len(outp_json) == 29, "Wrong size json"


Test cases should be easy to understand. Eg. what does 29 here means, does it need a comment there to explain this, or self explanatory constant will describe it?

as I'm not going to change csv file for the test I'm expecting the same result about the result json size. But if service here (f'http://www.ebi.ac.uk/spot/zooma/v2/api/services/annotate?propertyValue={label}') will be changed the result will be changed also
how is it better to explain here?

theisuru · 2023-07-24T10:40:30Z

tests/test_endpoints.py

+
+    first_5_elements = dict(islice(outp_json.items(), 5))
+
+    expected_values = {


What happen if zooma has new knowledge and there is another mapping in zooma output.

possibly we can check json keys only, like:
'Age at present'
'Age at the agreement date',
etc. ...

theisuru · 2023-07-24T10:46:46Z

harmonise/zooma.py

+import requests
+
+
+class ZoomaClient:


This class looks static, there are few ways we can improve this to make it better OOP

the URL could be a class variable accepting in the constructor (eg. base_url)

Rename get_json should have better name and could accept arguments (eg. field_label) and then construct the final API call url from base_url

do you mean:

field_label_json = zooma_client.field_label() if field_label_json is not None: for i, el in enumerate(field_label_json):

?

theisuru · 2023-07-24T10:48:57Z

harmonise/match.py

+
+        for label in labels:
+            if len(label) != 0:
+                fm_cl = FieldMatchingService()


FieldMatchingService and ZoomaClient doing two different things?
Also need to extract the URL to a variable or a config.

Mil-m · 2023-07-25T01:35:35Z

tests/test_endpoints.py

+                       'Alcohol consumption habits', 'Birthdate']
+
+    for key, value in first_5_elements.items():
+        assert key in expected_values, f"Unexpected key in json: {key}"


now I'm checking only keys in the json

mansurova added 12 commits May 15, 2023 11:51

getting fields from zooma

ee14fed

refactoring

1e23558

refactoring additional fixes

080e3b8

refactoring fixes (.env.txt file with port / .sh file for releasing t…

f68e098

…he port / requirements.txt update)

Dockerfile addition

21baa23

Docker-Flask fixes

3b6b0f0

adding shared directory

e4befb0

refactoring fix

68e7c96

README fix

b77b32d

using docker-compose.yml file

ce4e6cc

using POST in the 'match' endpoint

302e060

refactoring fixes

530c2bc

theisuru reviewed Jul 24, 2023

View reviewed changes

Mil-m force-pushed the 33-zooma_endp branch from 52b39cb to 7604c6c Compare July 25, 2023 01:11

additional refactoring fixes

7bc55b3

Mil-m force-pushed the 33-zooma_endp branch from 7604c6c to 7bc55b3 Compare July 25, 2023 01:32

Mil-m commented Jul 25, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

33 zooma endp #1

33 zooma endp #1

Mil-m commented Jul 23, 2023

theisuru left a comment

theisuru Jul 24, 2023

theisuru Jul 24, 2023

theisuru Jul 24, 2023

Mil-m Jul 25, 2023

theisuru Jul 24, 2023

Mil-m Jul 25, 2023

theisuru Jul 24, 2023

Mil-m Jul 25, 2023

theisuru Jul 24, 2023

Mil-m Jul 25, 2023

theisuru Jul 24, 2023

Mil-m Jul 25, 2023

		@@ -1,12 +1,29 @@
		id,name,label,description,type,values,parent,annotations,tags


		first_5_elements = dict(islice(outp_json.items(), 5))

		expected_values = {

33 zooma endp #1

Are you sure you want to change the base?

33 zooma endp #1

Conversation

Mil-m commented Jul 23, 2023

theisuru left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment