You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
40 lines
804 B
Markdown
40 lines
804 B
Markdown
# Vector-sync HTTP embedding provider
|
|
|
|
This patch adds a new provider type:
|
|
|
|
- `http-vector-sync`
|
|
|
|
## Request
|
|
Endpoint:
|
|
- `POST {baseUrl}/vector-sync`
|
|
|
|
Request body:
|
|
```json
|
|
{
|
|
"model": "intfloat/multilingual-e5-large",
|
|
"text": "This is a sample text to vectorize"
|
|
}
|
|
```
|
|
|
|
## Response
|
|
```json
|
|
{
|
|
"runtime_ms": 472.49,
|
|
"vector": [0.1, 0.2, 0.3],
|
|
"incomplete": false,
|
|
"combined_vector": null,
|
|
"token_count": 9,
|
|
"model": "intfloat/multilingual-e5-large",
|
|
"max_seq_length": 512
|
|
}
|
|
```
|
|
|
|
## Notes
|
|
- supports a single text per request
|
|
- works for both document and query embeddings
|
|
- validates returned vector dimension against the configured embedding model
|
|
- keeps the existing `/embed` provider in place as `http-json`
|
|
|
|
## Example config
|
|
See `application-new-example-vector-sync-provider.yml`.
|