Text to SQL RFT example #324

benjibc · 2025-11-10T06:57:51Z

Note

Extends Message.content to accept image content parts alongside text.

^{Written by Cursor Bugbot for commit 6026e78. This will update automatically on new commits. Configure here.}

cursor · 2025-11-10T06:59:12Z

text-to-sql-ep-rft/scripts/05_augment_sandbox.py

+    patterns = [
+        r"(?:FROM|JOIN)\\s+([a-zA-Z_][a-zA-Z0-9_]*)",
+        r'(?:FROM|JOIN)\\s+"([^"]+)"',
+        r"(?:FROM|JOIN)\\s+`([^`]+)`",


Bug: Regex Escaping Flaw Breaks SQL Parsing

The regex patterns use double-escaped backslashes (\\s, \\*) which will match literal backslash characters followed by s or *, not whitespace or comment delimiters. The patterns should use single backslashes (\s, \*) to properly match SQL whitespace and block comments. This causes extract_tables to fail at extracting table names from SQL queries.

cursor · 2025-11-10T06:59:12Z

text-to-sql-ep-rft/scripts/01_simulate_prod_db.py

+                urllib.request.urlretrieve(url, path)
+                print(f"Downloaded: {path}")
+            # df = pd.read_csv(path, header=None, names=COLUMN_NAMES[name], na_values=["\\N"])
+            con.execute(f'CREATE OR REPLACE TABLE "{name}" AS SELECT * FROM df')


Bug: Undefined Variable Blocks Critical Process

The code references variable df when creating tables, but df is never defined because the pd.read_csv line is commented out. This causes a NameError when the script runs, preventing the database tables from being created.

cursor · 2025-11-10T06:59:12Z

text-to-sql-ep-rft/scripts/benchmark_models.py

+        return 0
+    ascii_table = ev["result"]["content"][0]["text"]
+    pred = parse_duckdb_ascii(ascii_table)
+    return 1 if are_equal(pred, ground_truth) else 0


Bug: Unhandled HTTP errors crash evaluation function.

The run_eval function doesn't handle exceptions from the HTTP request, so if requests.post raises an exception (network error, timeout, etc.), the function crashes instead of returning 0. The exception handling only covers the comparison logic, not the MCP server communication.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-11-10T06:59:25Z

text-to-sql-ep-rft/evaluator/sql_rft_evaluator.py

+    try:
+        gt_vals = sorted([sorted(map(norm, r.values())) for r in ground_truth])
+        pr_vals = sorted([sorted(map(norm, r.values())) for r in pred])
+        ok = gt_vals == pr_vals
+        return {"score": 1 if ok else 0, "reason": "match" if ok else f"mismatch: gt={ground_truth} pred={pred}"}


Compare SQL results without respecting column names

The evaluator normalizes both ground_truth and pred by sorting only the value lists (sorted(map(norm, r.values()))). This drops the association between column names and values, so a prediction that returns the right set of values but assigns them to the wrong columns (e.g., swapping origin and destination) will be marked as a perfect match. That produces false positives and hides invalid SQL generations. Consider normalizing on keyed tuples (e.g., sorting each row by column name and comparing dicts) so column/value alignment is preserved.

Useful? React with 👍 / 👎.

cursor · 2025-11-23T17:30:18Z

eval_protocol/models.py

-    )
+    content: Optional[
+        Union[str, List[Union[ChatCompletionContentPartTextParam, ChatCompletionContentPartImageParam]]]
+    ] = Field(default="", description="The content of the message.")


Bug: Image content silently dropped by text extraction

The content field now accepts ChatCompletionContentPartImageParam, but existing code throughout the codebase (like _coerce_content_to_str functions in benchmark tests) only handles ChatCompletionContentPartTextParam by accessing the text attribute. When image parts are present, they're silently skipped since they have image_url instead of text, causing data loss without any error indication.

cursor bot reviewed Nov 10, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 10, 2025

View reviewed changes

Text to SQL RFT example

6026e78

benjibc force-pushed the text_to_sql_example branch from 2dbf46d to 6026e78 Compare November 23, 2025 17:28

cursor bot reviewed Nov 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Text to SQL RFT example #324

Text to SQL RFT example #324

Uh oh!

benjibc commented Nov 10, 2025 •

edited by cursor bot

Loading

Uh oh!

cursor bot Nov 10, 2025

Uh oh!

cursor bot Nov 10, 2025

Uh oh!

cursor bot Nov 10, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Nov 10, 2025

Uh oh!

cursor bot Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Text to SQL RFT example #324

Are you sure you want to change the base?

Text to SQL RFT example #324

Uh oh!

Conversation

benjibc commented Nov 10, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot Nov 10, 2025

Choose a reason for hiding this comment

Bug: Regex Escaping Flaw Breaks SQL Parsing

Uh oh!

cursor bot Nov 10, 2025

Choose a reason for hiding this comment

Bug: Undefined Variable Blocks Critical Process

Uh oh!

cursor bot Nov 10, 2025

Choose a reason for hiding this comment

Bug: Unhandled HTTP errors crash evaluation function.

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

cursor bot Nov 23, 2025

Choose a reason for hiding this comment

Bug: Image content silently dropped by text extraction

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

benjibc commented Nov 10, 2025 •

edited by cursor bot

Loading