Replies: 1 comment
-
|
I'm definitely not against more rules to "thin the noise". I opened an Elastic Search PR (still waiting for their review), but they have you sign a CLA, and the linting enforces good patterns. I think if we had checks that could qualify a PR by adding/removing a "ready-for-review" tag (based on conflicts, resolved bot comments, passing CI, etc) I'd be into it. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Background
This is prompted by a few PRs I've seen recently after tagging an issue "good first issue" and by these two blog posts:
I am not proposing a ban on AI-generated code from new contributors. For one thing, the majority of top contributors to Superset seem to be using AI to write and review code and even to comment on GitHub issues. I write my code contributions with AI. But I think there is a phenomenon to discuss.
Recent Superset experiences
The blog posts note that we (Superset committers) can probably move the project faster if we write PRs ourselves rather than review and give feedback on PRs from first-time authors. The value proposition for us is that we're growing potential long-term contributors who will over time give more to the project than they require in review.
That's why I tag issues "good first issue" instead of just fixing them myself, to get new people involved (at a time cost to myself as I work with the new author).
It raises the question of what we expect from first-time PR authors. I have recently seen PRs that consist only of:
When a human generates the AI code they can still add value: manually testing the output via a local Superset deployment, working through edge cases, or adding human comments to an issue or PR. These authors did none of these.
I have twice now asked for screenshots and the contributors admitted they don't have a local instance of Superset running. To me that's unacceptable, to send code that you can't or won't even deploy yourself (except for maybe editing the docs, which was one of the cases). If setting up a dev environment is too big a hurdle then you should find another project to work on.
What to do about it
I would love to have some statement for potential new contributors that sets a certain bar. It's not about AI but it is only relevant in this era -- before AI you knew they had a certain amount invested already.
For starters I would say the expectations include:
pre-commitchecks passing before a committer reviewsIf your PR does not indicate you've done these things, it may be tagged as lacks-human-authorship and closed
Apache Airflow asks PR authors if they used AI to generate the code. At this point I assume any new code is AI generated so I actually don't think we need that.
Agree? Disagree? Other thoughts?
Beta Was this translation helpful? Give feedback.
All reactions