Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenTracing being sunset for OpenTelemetry #2934

Open
tsloughter opened this issue Jun 9, 2020 · 14 comments
Open

OpenTracing being sunset for OpenTelemetry #2934

tsloughter opened this issue Jun 9, 2020 · 14 comments

Comments

@tsloughter
Copy link

Summary

I've been reaching out over the past few months to all maintainers of OpenTracing libraries in Erlang or Elixir. I learned about https://github.com/apache/couchdb/tree/prototype/fdb-layer/src/ctrace today and figured I'd do the same even though it is built into couchdb.

Desired Behaviour

Since OpenTracing is being sunset and OpenTelemetry (the merger of OpenTracing and OpenCensus) is its replacement it is likely desirable to users of instrumented libraries that they eventually move to OpenTelemetry, https://opentelemetry.io/

Possible Solution

There is an official OpenTelemetry Erlang API and SDK (https://github.com/open-telemetry/opentelemetry-erlang-api/ and https://github.com/open-telemetry/opentelemetry-erlang/), along with a SIG to discus implementation. Before today the SIG was only meeting once a month as part of the Erlang Ecosystem Foundation's Observability Working Group, but it will be being updated to meet weekly.

It is certainly not urgent or a must even if OpenTracing the spec is being deprecated, your tracing lib and reporting to Jaeger will continue to work for the foreseeable future -- and I see you have special couchdb specific functionality builtin, so not like you could simply drop in a new lib anyway. I just wanted to reach out like I have to all the others in hopes of collaboration on the future standard of open distributed tracing, metrics and logging in Erlang/Elixir.

@wohali
Copy link
Member

wohali commented Jun 9, 2020

@iilyak what do you think?

@iilyak
Copy link
Contributor

iilyak commented Jun 9, 2020

Few points which would probably make https://github.com/open-telemetry/opentelemetry-erlang-api unsuitable for us:

@iilyak
Copy link
Contributor

iilyak commented Jun 9, 2020

It looks like the author of the library we currently use abandoning it. It is really sad because it is wonderfully written with performance in mind.

@tsloughter
Copy link
Author

No, spans are not stored in a gen_servers state, they are stored in an ETS table while the span context is propagated within a process through the pdict.

Yes, persistent term is currently used without the ability to disable it. I don't know what couch's OTP back support is but there is always the option of modifying otel to use ETS when persistent_term isn't available, or just not moving to otel until the oldest OTP supported by couch is 21.3.

The opentelemetry_exporter lib does not require use of GRPC, though I also don't think grpcbox is an issue in this use case because of the lack of concurrency in the exporter. But it supports using plain http 1.1 as well. httpc is the only option at the moment simply because that was the quickest one to add support for. There is no plan to require httpc or any particular client, it is simply a matter of resources and priorities.

I'm sad to hear you dropped grpc and use of grpcbox because of this stream creation bug. I'm glad you linked to that issue on chatterbox (I maintain chatterbox as well) and will take a look at the issue soon, it must have slipped by me in email notifications and I hadn't manually looked at chatterbox's issues in a long time.

I wish I'd known couchdb was trying to use grpcbox/chatterbox. I would have made an effort to take care of these issues with you.

I believe grpcbox is mostly used for the server component by those using it in production. The client has been part that I've wanted to rewrite, and have started the rewrite recently. Though this does look like a chatterbox internal bug. Again an issue of resources and priorities.

@wohali
Copy link
Member

wohali commented Jun 9, 2020

@tsloughter We currently support 20.3 as our base, with "soft" support back to 19.latest. We're not planning on making 21.3 a minimum version for a while yet, so that's probably a hard stop for us for at least a year.

@iilyak
Copy link
Contributor

iilyak commented Aug 25, 2020

We can have our own persistent_term like module and patch 3 lines in opentelemetry. However there might be other incompatibilities to uncover.

@iilyak
Copy link
Contributor

iilyak commented Aug 25, 2020

The persistent_term is quite easy (about 150 LOC) to re-implement using merl.

@iilyak
Copy link
Contributor

iilyak commented Aug 25, 2020

The nice thing about OpenTelemetry is that it has some integration with seqtrace. Although I hope it is optional since we use sensitive flags on processes which handle http request. Which makes erlang tracing impossible. Currently it seems it relies on old erlang tracing API. Possibly due to impossibility of correlating call and replies in new tracing API as mentioned here census-instrumentation/opencensus-erlang#72 (comment). Just for the reference the rabbitmq/looking_glass NIF seem to have the same problem.

@iilyak
Copy link
Contributor

iilyak commented Aug 25, 2020

If we would find a way to use erlang tracing safely (without leaking sensitive data) we would be able to debug and trace system on demand even if we don't instrument the codebase using with_span.

@tsloughter
Copy link
Author

OpenTelemetry doesn't integrate with seqtrace. Originally I did include an implementation of ctx with seqtrace but removed it because I think how it functions is too confusing.

@tsloughter
Copy link
Author

Just saw @dch mention a new CouchDB release on irc and it reminded me to look back at this thread. I thought I'd just mention some development on the OpenTelemetry side, first being that the tracing spec 1.0 has been released, https://medium.com/opentelemetry/opentelemetry-specification-v1-0-0-tracing-edition-72dd08936978

The Erlang implementation has been stuck at an RC release for months but that will be changing soon with a 1.0 release in the coming weeks.

It has also been over a year since this was opened so maybe support for pre-21 Erlang is considered being dropped now?

Also all development is now done under the repo https://github.com/open-telemetry/opentelemetry-erlang/ (plus https://github.com/open-telemetry/opentelemetry-erlang-contrib where instrumentation for other libraries are kept) and the old opentelemetry-erlang-api repo is archived.

@nickva
Copy link
Contributor

nickva commented Sep 30, 2021

@tsloughter that's great to hear about OpenTelemetry, thanks for the update!

The new CouchDB 3.2 release still support Erlang and is already in the final release candidate phase. However, by the next major release it's highly likely we'd bump the minimum Erlang version to be >= 21, so OpenTelemetry would become easier to integrate.

@tsloughter
Copy link
Author

Ran into CouchDB metrics somewhere today and it made me think of this issue, so thought I'd pop an update here. I see in dev the required OTP version is now OTP 23+. Also, 1.0 of Erlang/Elixir Opentelemetry was released back in February https://medium.com/opentelemetry/opentelemetry-erlang-elixir-javascript-and-ruby-v1-0-3a0c32e0add4

@nickva
Copy link
Contributor

nickva commented Nov 10, 2022

@tsloughter thanks for stopping by and for the reminder! Yeah, with Erlang 23 as the minimum version it should make it easier to integrate.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants