-
Notifications
You must be signed in to change notification settings - Fork 879
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Request Content-Length fix for Apache HttpClients #6749
base: main
Are you sure you want to change the base?
Request Content-Length fix for Apache HttpClients #6749
Conversation
@trask @mateuszrzeszutek the change is ready and I have been able to cover all possible scenarios to be able to provide these features till the oldest version of http client 4.x 😌 I would be grateful if you could find some time from your busy schedules to review these changes. Thanks. |
@mateuszrzeszutek if you get time can you also start review of this PR? |
@trask this seems like a big change, but can you find some time to look into this if possible. |
…d when instrumentation is suppressed
d64d59d
to
7760022
Compare
b0cfe98
to
a106ca4
Compare
@mateuszrzeszutek @trask can you please look into this MR, I had completed the changes almost a month ago and had to rebase the changes all over again because of few changes being pushed to master recently. Since this is a big change, rebasing becomes very difficult and eats up a lot of time. |
@trask any ETA when this can be looked into? |
hi @anuragagarwal561994, sorry for the long delay. do you think you could find some ways to split this out into a few smaller PRs that would make it easier to review? thx! |
Hello @trask thanks for taking time to respond back. This MR involves a lot of small improvements. I will try my best to break it down in smaller pieces but I will be needing constant advice on the same. Major change here have been because common code was extracted from all the modules and put in a common place to avoid repetitions |
maybe a first PR that just moves things around (in preparation for next PR(s))? |
So I was thinking that I can rebase merge few commits here and there first. Each commit representing the problem that is solved as stated in the description. There you can either review commit by commit or I can break those commits as separate merge requests. |
I have changed my approach as rebasing was slowing me down and requiring more efforts. I will be submitting a list of small PRs with and will link the issue on this thread. |
@trask let's start to review the above 2 PRs and then we I will create more, because they are kind of dependent. If there are more review comments, I will have to update all of them. |
Closes #6747
Changes Summary
Implementation Summary
BytesTransferMetrics: This is used to record the actual bytes transferred when a request is sent or the response is received. The recorded metrics is used in case when the request is sent / received using chunked encoding, because in that case the
content-length
header is not present and we can't determine the content length without consuming the whole content. It fills up the content length attributes when the headers are not present.CountingOutputStream: This is used to calculate the bytes written by the request when the entity is chunked. We keep a counter to add the buffer sizes. Implementation has picked up from
Guava
&Apache Libraries
.ApacheHttpClientProcessorInstrumentation
The internal request is actually present in HttpContext and the first approach was to fetch from the same, but there were couple of problems doing so.
To resolve the above limitation a different approach was incorporated. We instrumented the
HttpProcessor
which will work for both the http classic client / server and it will receive the request / response as parameters. We will store them with respect to otel context in a virtual field and use the value at the time of ending the instrumentation.Note that here the context of the http classic client will be same as the context of the request, hence we can directly use currentContext.
For async client, the context will change but in async client we can ensure we always have a HttpContext or we create one if not provided by the user. Hence we add otel context in the http context and use it when this http context is passed by the client to the processor methods. We keep updating the latest request / response received because in case of retries or authentication logics, we might want to keep the latest.
Side Note: HttpProcessor can also run for Apache Http Server provided by the library and also for the suppressed spans. Hence we check for HTTP_CLIENT SpanKey to be part of the current context and only then we will execute the logic.
HttpOtelContext: This is just an adapter to make our life easier to use http context in a unified way, to store, fetch and remove otel related information.
ApacheHttpClientEntityStorage: To deal with storage of and within http client entities.
ApacheHttpClientAttributesHelper: Contains all attribute related helpers
ApacheHttpClientInstrumentationHelper: Contains instrumentation related helpers
Other instrumentation changes
opensearch-rest-client
: addedREQEUST_CONTENT_LENGTH
attribute as 0 for GET requestselasticsearch-rest-client
: addedREQEUST_CONTENT_LENGTH
attribute as 0 for GET requeststwilio
: here although we should addREQEUST_CONTENT_LENGTH
&RESPONSE_CONTENT_LENGTH
in the tests, but the tests are using mocked clients instead of actual once, hence they don't follow the entire process of instrumentation.I tried implementing non mocked client, but since the urls for twilio are hardcoded, I would have had to handle that case as well. Moreover it would have required making adding a dummy server in the test cases as well. Left that implementation, because anyways the instrumentations guarantees might not be applicable on mocked entities as they are not following the whole process.