text¶
Manipulation of textual data.
Textual data (pre)processing¶
|
Remove punctuation and optionally whitespace from textual data. |
|
Generate an acronym (in capital letters) from textual data. |
|
Extract words from a string by splitting it at occurrences of uppercase letters. |
Convert a number written in English words into its equivalent numerical value represented in Arabic numerals. |
|
|
Count the occurrences of each word in the given text. |
|
Calculate Inverse Document Frequency (IDF) for a sequence of textual documents. |
|
Calculate TF-IDF (Term Frequency-Inverse Document Frequency) for the given textual documents. |
Textual data analysis¶
|
Compute the Euclidean distance between two sentences. |
|
Calculate the cosine similarity between two sentences. |
|
Find all strings in a sequence that match a given string. |
|
Find |