Before submitting your bug report
Relevant environment info
- OS:Ubuntu 24.04
- Continue version:1.2.22
- IDE version:VSCode 1.118.1
- Model:qwen3-coder-30b-a3b-instruct
- config:
name: Local Config
version: 0.0.1
schema: v1
models:
- name: code autocomplete
provider: openai
model: qwen3-coder-30b-a3b-instruct
apiBase: http://localhost:3000/v1
useResponsesApi: false
roles:
- autocomplete
- edit
autocompleteOptions:
disable: false
Description
I am trying to get continue running with a LLM that is hosted on a server that does not use a standard API. For this I wrote a proxy in python to translate the openai API to the one my server uses. For the chat I got that working without problems, but autocomplete does not register a response as seen in the following screenshot.
For the proxy server I am using the following python code (with a separate function for the chat). I have also tried using a streamed response with the same result.
from fastapi import FastAPI
from fastapi.responses import JSONResponse
app = FastAPI()
@app.post("/v1/completions")
async def completions(req):
# communication with LLM server generates variable "text"
...
response = {
"id": "cmpl-" + str(int(time.time())),
"object": "text_completion",
"created": int(time.time()),
"model": req.model,
"choices": [
{
"index": 0,
"message":{
"role": "assistant",
"content": text
},
"finish_reason": "stop"
}
],
#"usage": {
# "completion_tokens": 16,
# "prompt_tokens": 10,
# "total_tokens": 26
#}
}
return JSONResponse(
content=response
)
The answers from the server typically have the format:
''' python
some completion
'''
I have also tried hardcoding the response as e.g.:
To reproduce
No response
Log output
Before submitting your bug report
Relevant environment info
Description
I am trying to get continue running with a LLM that is hosted on a server that does not use a standard API. For this I wrote a proxy in python to translate the openai API to the one my server uses. For the chat I got that working without problems, but autocomplete does not register a response as seen in the following screenshot.
For the proxy server I am using the following python code (with a separate function for the chat). I have also tried using a streamed response with the same result.
The answers from the server typically have the format:
I have also tried hardcoding the response as e.g.:
To reproduce
No response
Log output