Elasticsearch update script using python bulk update [closed]

Ask Question

Asked 5 years, 2 months ago

Modified 5 years, 2 months ago

Viewed 2k times

Closed. This question is off-topic. It is not currently accepting answers.

Missing Review Context: Code Review requires concrete code from a project, with enough code and / or context for reviewers to understand how that code is used. Pseudocode, stub code, hypothetical code, obfuscated code, and generic best practices are outside the scope of this site.

Closed 5 years ago.

Improve this question

I'm using this script to bulk update docs in my index.

I need to update a field of a doc in Elasticsearch and add the count of that doc in a list inside python code. The weight field contains the count of the doc in a dataset. The dataset needs to be updated from time to time.So the count of each document must be updated too. hashed_ids is a list of document ids that are in the new batch of data. the weight of matched id must be increased by the count of that id in hashed_ids.

For example let say a doc with id=d1b145716ce1b04ea53d1ede9875e05a and weight=5 is already present in index. and also the string d1b145716ce1b04ea53d1ede9875e05a is repeated three times in the hashed_ids so the update_with_query query will match the doc in database. I need to add 3 to 5 and have 8 as final weight.

The code below works for it but it is too slow and from time to time I get time out error.

hashed_ids = [hashlib.md5(doc.encode('utf-8')).hexdigest() for doc in shingles]
update_by_query_body =
{
  "query":{
    "terms": {
      "id":["id1","id2"]
    }
  },
  "script":{
    "source":"long weightToAdd = params.hashed_ids.stream().filter(idFromList -> ctx._source.id.equals(idFromList)).count(); ctx._source.weight += weightToAdd;",
    "params":{
      "hashed_ids":["id1","id1","id1","id2"]
    }
  }
}

edited Oct 10, 2020 at 18:36

hjpotter92

8,9411 gold badge26 silver badges50 bronze badges

asked Oct 10, 2020 at 13:16

Marzi Heidari

1698 bronze badges

\$\begingroup\$ Where is the rest of your code? Please include it. \$\endgroup\$

Mast
– Mast

2020-10-16 15:44:35 +00:00
Commented Oct 16, 2020 at 15:44

Add a comment |

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

Elasticsearch update script using python bulk update [closed]

0

Hot Network Questions

Elasticsearch update script using python bulk update [closed]

0

Related

Hot Network Questions