Llama-index chatbot in FastAPI endpoint throws "Event loop is already running"

Question

The end-point implementation is like so:

@app.post("/api/chat/{question}", dependencies=[Depends(sessionValidator)])
async def chat(question: str = Path(...), my_chatbot=Depends(chatbotProvider)):
    # Generate response
    try:
        logger.info(f"Getting LLM response..")

        ans = my_chatbot.chatbot.chat(question)
        
        logger.info(f"SUCCESS! Chatbot responded: {ans.response}")

        return ans.response
    
    except Exception as e:
        logger.error(f"FAILED! Got no response from chatbot! Err {e}")
        raise HTTPException(status_code=500, detail="Chatbot gave no response!")

The chatbot (whose chat method I have called) is llama-index chat engine based on vector index. It is an instance of class BaseChatEngine from llama_index.core.chat_engine.types and uses an Ollama Lllama3 model from llama_index.llms.ollama.

Here is the code the creates the chatbot in my wrapper class MyChatAssistant that has the llama-index's BaseChatEngine instance as attribute chatbot:

self.chatbot = self.index.as_chat_engine(
                chat_mode = self._chat_mode,
                verbose = True,
                similarity_top_k = 6,
                filters = MetadataFilters(filters=[
                    ExactMatchFilter(key="transcript_id", value=self.transcript_id),]        
                )
            )

Interestingly, I have extensively tested the chatbot (minus Rest API endpoint wrapper) in Jupyter notebook. There I used:

import nest_asyncio
nest_asyncio.apply()

This took care of all issues. The same did not work in FastAPI as it seems the event loop that FastAPI uses is a different type.

I have tried the following:

4.

ans = asyncio.get_event_loop().run_until_complete(my_chatbot.chatbot.chat(question))
# or
loop = asyncio.get_event_loop()
ans = loop.create_task(my_chatbot.chatbot.chat(question))
await and

Err: This event loop is alreay running

3.

ans = asyncio.to_thread(my_chatbot.chatbot.chat(question))
# or
ans = await asyncio.to_thread(my_chatbot.chatbot.chat(question))

Err: This event loop is alreay running

2.

loop = asyncio.get_event_loop()
ans = await loop.run_in_executor(None, my_chatbot.chatbot.chat, question)

ans = my_chatbot.chatbot.chat(question)
# or
ans = await my_chatbot.chatbot.chat(question)

Grateful for any help!

Does this answer your question?

Chris
– Chris

2024-12-19 17:06:13 +00:00
Commented Dec 19, 2024 at 17:06 — Chris
– Chris, Commented Dec 19, 2024 at 17:06

Yurii Motov · Accepted Answer · 2024-12-19 17:18:22Z

0

I guess my_chatbot.chatbot.chat() is a sync wrapper that runs async function in the event loop. So, you need to use async alternative of this method.

Try changing

        ans = my_chatbot.chatbot.chat(question)

to

        ans = await my_chatbot.chatbot.achat(question)

answered Dec 19, 2024 at 17:18

Yurii Motov

2,6132 gold badges6 silver badges16 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Sun Bee Dec 20, 2024 at 3:47

thank you for your suggestion. It makes sense. However this change caused some unexpected side-effects. Specifically, this makes the bot retrieve the raw vector embeddings instead of text as context. I made no other change to cause this. I will investigate and circle back.

Collectives™ on Stack Overflow

Llama-index chatbot in FastAPI endpoint throws "Event loop is already running"

1 Answer 1

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related