Rosette

Column	Type	Required	Notes
Document ID	string	Yes	Unique per row; used to correlate outputs.
Text to process	string	Yes	UTF-8 text; HTML should be pre-cleaned if you don’t want tags analyzed.
(implicit) Lang	string	No	Use Source text language parameter or AUTO.

Name	Type	Required	Default	Description
Document ID	Column selector	Yes	—	Column containing a unique identifier for the row/document.
Text to process	Column selector	Yes	—	Column containing the text to send to Rosette.
Operation	Enum	Yes	`sentiment analysis`	One of: `sentiment analysis`, `parts-of-Speech (POS)`, `entity extraction`, `categorization`.
Source text language	Enum	No	`AUTO`	Language hint for the text. Options include `AUTO`, `Arabic (ara)`, `Chinese (zho)`, `Dutch (nld)`, `English (eng)`, `French (fra)`, `German (deu)`, `Hebrew (heb)`, `Indonesian (ind)`, `Italian (ita)`, `Japanese (jpn)`, `Korean (kor)`, `Pashto (pus)`, `Persian/Dari/Farsi (fas)`, `Portuguese (por)`, `Russian (rus)`, `Spanish (spa)`, `Urdu (urd)`. (Sentiment & Categorization: English only.)
API key	Secret string	Yes	—	Your Rosette API key. Do not commit to source control.
Output column name prefix	String	No	(blank)	An optional prefix added to all generated output columns (e.g., `ros_`).

Symptom	Likely Cause	Fix
401/403 in log	Invalid/missing API key; wrong scope	Re-paste a valid key; confirm it’s not expired; avoid trailing spaces.
413 / “payload too large”	Row text exceeds 600 KB or 50k chars	Truncate or split long documents upstream.
Empty sentiment results	Non-English text or undetected language	Ensure English text; set Source text language = English (eng) explicitly.
422 / unsupported language for operation	Using Sentiment/Categorization on non-English	Restrict those ops to English; for other languages, use POS or Entity extraction.
Timeouts / connection errors	Network egress blocked; timeout too low	Allow outbound HTTPS; increase TCP/IP timeout (e.g., 15000 ms); set retries.
429 / rate limited	Throughput too high	Set Max throughput (e.g., 60 req/min); add backoff at the pipeline level.
No new columns	Output prefix + schema expectation mismatch	Remove the prefix temporarily to inspect names; verify downstream writer schema refresh.
Filter returned 0 rows	Process-all OFF with strict filter	Turn Process all rows ON or adjust filter expression/value.

¶ Description