Skip to content

Add serialized_space schema and best practices to databricks-genie skill#473

Open
sean-zhang-dbx wants to merge 1 commit intodatabricks-solutions:mainfrom
sean-zhang-dbx:feature/genie-schema-best-practices
Open

Add serialized_space schema and best practices to databricks-genie skill#473
sean-zhang-dbx wants to merge 1 commit intodatabricks-solutions:mainfrom
sean-zhang-dbx:feature/genie-schema-best-practices

Conversation

@sean-zhang-dbx
Copy link
Copy Markdown

Summary

  • Adds references/schema.md: complete serialized_space JSON schema with field reference tables, formatting rules, prompt matching docs, join_specs format, and benchmark configuration
  • Adds references/best-practices.md: curated best practices covering instruction authoring, column configuration, benchmarks, a troubleshooting decision tree, and a 23-item validation checklist
  • Updates SKILL.md to link to the two new reference files

Context

The existing databricks-genie skill covers MCP tool usage and the Conversation API well, but has no documentation for the serialized_space JSON payload structure or deeper best practices around instruction authoring, prompt matching, benchmarks, and troubleshooting. This PR fills those gaps with content adapted from the prompt-to-genie project.

Test plan

  • Verified schema.md JSON example is valid and field reference tables are accurate
  • Verified best-practices.md covers instruction priority, SQL expressions, example SQL, text instructions, join specs, column config/prompt matching, benchmarks, troubleshooting, and validation checklist
  • Verified no external documentation URLs are referenced
  • Verified SKILL.md reference links resolve correctly

Add two reference documents that fill gaps in the existing Genie skill:

- references/schema.md: Complete serialized_space JSON schema including
  field reference tables, formatting rules (sql as string arrays, ID
  generation, sorting requirements), prompt matching (v1 vs v2 fields,
  API vs UI behavior), join_specs format (--rt= annotation), and
  benchmark configuration.

- references/best-practices.md: Curated best practices covering
  instruction priority (SQL expressions > example SQL > text),
  SQL expression authoring (measures/filters/dimensions), example SQL
  query formatting, parameterized queries, text instruction tips,
  column configuration and prompt matching, benchmark creation
  (core vs stretch), a troubleshooting decision tree for 8 common
  issues, and a 23-item validation checklist.

SKILL.md updated to link to the new reference files.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant