LLMJoin

ingredients

Usage

`LLMJoin`

Bases: JoinIngredient

`from_args(model=None, use_skrub_joiner=True, few_shot_examples=None, num_few_shot_examples=None, enable_constrained_decoding=True)` `classmethod`

Creates a partial class with predefined arguments.

Parameters:

Name	Type	Description	Default
`few_shot_examples`	`Optional[Union[List[dict], List[AnnotatedJoinExample]]]`	A list of AnnotatedJoinExamples dictionaries for few-shot learning. If not specified, will use default_examples.json as default.	`None`
`use_skrub_joiner`	`bool`	Whether to use the skrub joiner. Defaults to True.	`True`
`num_few_shot_examples`	`Optional[int]`	Determines number of few-shot examples to use for each ingredient call. Default is None, which will use all few-shot examples on all calls. If specified, will initialize a haystack-based DPR retriever to filter examples.	`None`

Returns:

Type	Description
	Type[JoinIngredient]: A partial class of JoinIngredient with predefined arguments.

Examples:

from blendsql import BlendSQL
from blendsql.ingredients.builtin import LLMJoin, DEFAULT_JOIN_FEW_SHOT

ingredients = {
    LLMJoin.from_args(
        few_shot_examples=[
            *DEFAULT_JOIN_FEW_SHOT,
            {
                "join_criteria": "Join the state to its capital.",
                "left_values": ["California", "Massachusetts", "North Carolina"],
                "right_values": ["Sacramento", "Boston", "Chicago"],
                "mapping": {
                    "California": "Sacramento",
                    "Massachusetts": "Boston",
                    "North Carolina": "-"
                }
            }
        ],
        num_few_shot_examples=2
    )
}

bsql = BlendSQL(db, ingredients=ingredients)

Description

This ingredient handles the logic of semantic JOIN clauses between tables.

In other words, it creates a custom mapping between a pair of value sets. Behind the scenes, this mapping is then used to create an auxiliary table to use in carrying out an INNER JOIN.

For example:

SELECT Capitals.name, State.name FROM Capitals
    JOIN {{
        LLMJoin(
            'Capitals::name',
            'States::name',
            question='Align state to capital.',
        )
    }}

The above example hints at a database schema that would make E.F Codd very angry: why do we have two separate tables States and Capitals with no foreign key to join the two?

BlendSQL was built to interact with tables "in-the-wild", and many (such as those on Wikipedia) do not have these convenient properties of well-designed relational models.

For this reason, we can leverage the internal knowledge of a pre-trained LLM to do the JOIN operation for us.

LLMJoin

Usage

LLMJoin

from_args(model=None, use_skrub_joiner=True, few_shot_examples=None, num_few_shot_examples=None, enable_constrained_decoding=True) classmethod

Description

`LLMJoin`

`from_args(model=None, use_skrub_joiner=True, few_shot_examples=None, num_few_shot_examples=None, enable_constrained_decoding=True)` `classmethod`