chunks
sample_word(self, context, max_tokens=5, allow_punctuation=True)
async
Sample a word from the LMContext
object context
.
Source code in hfppl/chunks.py
sample_word_2(self, context, max_chars=None, allow_mid_punctuation=True, allow_end_punctuation=True)
async
Sample a word from the LMContext
object context
.
Unlike sample_word() above, this method allows for character-level control over the length of the word. It also allows for control over the presence of punctuation in the middle and at the end of the word.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
max_chars |
int
|
Maximum number of characters in the word. If None, the model will sample a word of any length. |
None
|
allow_mid_punctuation |
bool
|
If True, the model may sample punctuation in the middle of the word. |
True
|
allow_end_punctuation |
bool
|
If True, the model may sample punctuation at the end of the word. |
True
|
Returns:
Type | Description |
---|---|
Tuple[str, str]: The sampled word and punctuation |