-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retry Policy for Jmx Connection #700
Labels
enhancement
New feature or request
PoC/Agent
Tasks related to new generation of ecchronos as an agent
Comments
VictorCavichioli
added
enhancement
New feature or request
PoC/Agent
Tasks related to new generation of ecchronos as an agent
labels
Aug 21, 2024
2 tasks
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 3, 2024
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 3, 2024
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 4, 2024
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 5, 2024
- Removed unused imports - Increased checkstyle line length - Fixed PMD warning
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 5, 2024
- Retry Policy for jmx connection - Read all nodes statuses from nodes_sync table - Filter out all node with unavailable status - Retry to attempt jmx connection for unavailable nodes - If connection successful add connection and change status to available in table. - If connection failed after given retry attempts, change status to unreachable in table. - This is schedule service that would execute after given fix delay. By default it would run after every 24 hour and scan the table.
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 5, 2024
- Retry Policy for jmx connection - Read all nodes statuses from nodes_sync table - Filter out all node with unavailable status - Retry to attempt jmx connection for unavailable nodes - If connection successful add connection and change status to available in table. - If connection failed after given retry attempts, change status to unreachable in table. - This is schedule service that would execute after given fix delay. By default it would run after every 24 hour and scan the table.
SajidRiaz138
pushed a commit
to SajidRiaz138/ecchronos
that referenced
this issue
Sep 10, 2024
- Retry Policy for jmx connection - Read all nodes statuses from nodes_sync table - Filter out all node with unavailable status - Retry to attempt jmx connection for unavailable nodes - If connection successful add connection and change status to available in table. - If connection failed after given retry attempts, change status to unreachable in table. - This is schedule service that would execute after given fix delay. By default it would run after every 24 hour and scan the table.
tommystendahl
pushed a commit
that referenced
this issue
Sep 10, 2024
- Retry Policy for jmx connection - Read all nodes statuses from nodes_sync table - Filter out all node with unavailable status - Retry to attempt jmx connection for unavailable nodes - If connection successful add connection and change status to available in table. - If connection failed after given retry attempts, change status to unreachable in table. - This is schedule service that would execute after given fix delay. By default it would run after every 24 hour and scan the table. Co-authored-by: sajid riaz <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
enhancement
New feature or request
PoC/Agent
Tasks related to new generation of ecchronos as an agent
Story Description:
We create the Jmx connections on the startup, if we success establish a connection with a node, we include this connection in the list and we mark this node as AVAILABLE on nodes_sync, else, we mark it as UNAVAILABLE, the main goal would be to define a retry policy for each UNAVAILABLE node, after the max retries, the node should be marked as UNREACHABLE.
Acceptance Criteria:
NA
Definition of Done:
Code working as described, test cases done, approved by maintainers.
Notes:
I did a patch for the a RetryPolicy on the sidecar application startup that could be used to do it, check #656
Related to #652
The text was updated successfully, but these errors were encountered: