Embedding
Embedding or vectorization is the process of converting content into sets of numbers that, when compared mathematically with each other, quantify how similar they are in meaning. Embeddings are used to power semantic search engines, text classification, and cluster analysis as well as the retrieval component of retrieval-augmented generation (RAG) applications. Isaacus’ flagship embedding model is Kanon 2 Embedder, which currently ranks as the most accurate legal AI embedder on the Massive Legal Embedding Benchmark (MLEB).Model | Architecture | Dimensions | Context window | Description |
---|---|---|---|---|
kanon-2-embedder | Kanon 2 | 1,792 (default), 1,024, 768, 512, 256 | 16,384 | The most accurate legal AI embedder on the Massive Legal Embedding Benchmark (MLEB). |
Extractive question answering
Extractive question answering or answer extraction is the process of extracting answers to questions from documents. Extractive QA can be used to solve a broad range of information extraction problems from pulling out key dates from a contract to highlighting legal doctrines relied upon in a judgment. We offer two extractive QA models, tailor-made for common legal information extraction use cases: Kanon Answer Extractor and Kanon Answer Extractor Mini.Model | Architecture | Context window | Description |
---|---|---|---|
kanon-answer-extractor | Kanon | 512 | Our base answer extractor, designed to balance precision with throughput. |
kanon-answer-extractor-mini | Kanon | 512 | A lighter version of the Kanon Answer Extractor, designed to maximize throughput and cost-efficiency. |
Universal classification
Universal classification (also known as zero-shot classification) refers to the process of determining whether a given statement about a document, for example, “this is a confidentiality clause”, is supported by that document. Unlike traditional classifiers, universal classifiers don’t require training data to classify a document. In practice, universal classifiers can be used for a broad range of information retrieval and extraction tasks, from pulling out particular types of clauses from contracts all the way to scoring the relevance of legal documents to search queries. Isaacus currently offers two universal classifiers: Kanon Universal Classifier and Kanon Universal Classifier Mini.Model | Architecture | Context window | Description |
---|---|---|---|
kanon-universal-classifier | Kanon | 512 | Isaacus’ powerful flagship universal classification model. |
kanon-universal-classifier-mini | Kanon | 512 | A more lightweight variant of the Kanon Universal Classifier, optimized for speed and cost-efficiency. |