embedding nv2
parent
6a9696a6a7
commit
87fdae9f21
@ -0,0 +1,39 @@
|
||||
# NV2 - Embedding persistence and job orchestration
|
||||
|
||||
This patch continues the new parallel `at.procon.dip.embedding.*` subsystem introduced in NV1.
|
||||
|
||||
## Scope
|
||||
|
||||
NV2 adds:
|
||||
|
||||
- representation-driven selection policy
|
||||
- `DOC.doc_embedding_job` queue table
|
||||
- job lifecycle service with retry scheduling
|
||||
- model-catalog sync into `DOC.doc_embedding_model`
|
||||
- persistence of vectors into `DOC.doc_embedding`
|
||||
- orchestrator for enqueueing and processing jobs
|
||||
- unit tests for the new orchestration layer
|
||||
|
||||
## Still intentionally missing
|
||||
|
||||
- no cutover of the old vectorization route
|
||||
- no scheduler / background polling by default
|
||||
- no semantic search engine yet
|
||||
- no migration / backfill yet
|
||||
|
||||
## Intended usage
|
||||
|
||||
New code can now do:
|
||||
|
||||
1. enqueue a document or representation for embedding with a configured model key
|
||||
2. process the pending jobs through the new provider-based subsystem
|
||||
3. store the resulting vectors in the generic DOC embedding tables
|
||||
|
||||
## Next step after NV2
|
||||
|
||||
NV3 should add:
|
||||
|
||||
- `PgVectorSemanticSearchEngine`
|
||||
- semantic repository
|
||||
- query embedding integration into the generic search engine
|
||||
- hybrid lexical + semantic fusion
|
||||
Loading…
Reference in New Issue