1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370
|
# SQLFluff - Contributing
:star2: **First** - thanks for being interested in improving SQLFluff! :smiley:
:star2: **Second** - please read and familiarise yourself with both the content
of this guide and also our [code of conduct](CODE_OF_CONDUCT.md).
:star2: **Third** - the best way to get started contributing, is to use the
tool in anger and then to submit bugs and features through GitHub.
In particular, in helping to develop the parser, examples of queries
that do not parse as expected are especially helpful.
:star2: **Fourth** - making sure that our documentation is up-to-date and useful
for new users is really important. If you are a new user, you are in precisely
the best position to do this. Familiarise yourself with the tool (as per step
2 above) and familiarise yourself with the current documentation (live version
at [docs.sqlfluff.com](https://docs.sqlfluff.com) and the source can be found
in the [docs](./docs/) folder of the repo). Pull requests are always welcome
with documentation improvements. Keep in mind that there are linting checks in
place for good formatting so keep an eye on the tests whenever submitting a PR.
We also have a [GitHub wiki](https://github.com/sqlfluff/sqlfluff/wiki) for
longer tutorials. We welcome
[contributions, suggestions or requests](https://github.com/sqlfluff/sqlfluff/issues/2104)
for the wiki.
:star2: **Fifth** - if you are so inclined - pull requests on the core codebase
are always welcome. Dialect additions are often a good entry point for new
contributors, and we have [a set of guides](https://docs.sqlfluff.com/en/stable/perma/guides.html)
to help you through your first contribution. Bear in mind that all the tests
should pass, and test coverage should not decrease unduly as part of the
changes which you make. You may find it useful to familiarise yourself with the
[architectural principles here](https://docs.sqlfluff.com/en/stable/perma/architecture.html)
and with the [current documentation here](https://docs.sqlfluff.com).
## How The Community Works
SQLFluff is maintained by a community of volunteers, which means we have a
few processes in place to allow everyone to contribute at a level that suits
them and at a time that suits them. These are not meant to be a way of restricting
development, but a way of allowing the community to agree what to focus on
and then effectively direct its focus toward that. Anyone can pipe up in these
discussions, and the more we hear from users the more we can build a tool
that is useful for the community.
- Large features for consideration will be organised into _Major Releases_.
These will usually include significant changes in functionality or backwards-incompatible
changes. As some of these features may require significant
coordination, discussion or development work, there is a process for each
major release to work out what features will fit into that release.
- Each major release will have its own GitHub issue. For example, the link
to the issue for [0.6.0 is here](https://github.com/sqlfluff/sqlfluff/issues/922).
- Features or issues are organised into a _shortlist_. During the initial
discussion for the release, each feature is vetted for enough clarity
that someone in the community can pick it up. Issues, where we cannot
reach clarity, will be pushed to the next release. Getting this clarity
is important before development work progresses so that we know that
larger changes are a) in line with the aims of the project and b) are
effectively pre-approved changes so that there are not any surprises
when it comes to merging.
- Once we reach the deadline for closing the roadmap for a release the
focus on development work should be on those features.
- Small features and bug fixes (assuming no backward compatibility issues)
do not need to go through the same process and vetting and can be picked
up and merged at any time.
### Maintainers
A small group of people volunteer their time to maintain the project and
share the responsibility for responding to issues and reviewing any proposed
changes via pull requests. Each one of them will be trying to follow
the process above and keep development work on the project moving. That
means for smaller changes and improvements they may review changes as
individuals and merge them into the project in a very lightweight way.
For larger changes, especially if not already part of the current major
release process the expectation is that they will involve other members
or the maintainer community or the project admins before merging in
larger changes or giving the green light to major structural project
changes.
## Nerdy Details
### Developing and Running SQLFluff Locally
#### Requirements
If you plan on working with a particular dbt plugin, you will need
to ensure your python version is high enough to support it. For example,
the instructions below use `python3.13`, and we support as low as `python3.9`.
The simplest way to set up a development environment is to use `tox`.
First ensure that you have tox installed:
```shell
python3.12 -m pip install -U tox
```
**IMPORTANT:** Python 3.9 is the minimum version we support. Feel free
to test on anything between `python3.9` and `python3.13`.
Note: Unfortunately tox does not currently support setting just a minimum
Python version (though this may be be coming in tox 4!).
#### Creating a virtual environment
A virtual environment can then be created and activated. For the
various versions currently available you can check the `tox.ini` file.
The numbers correspond to the dbt core version; dbt180 will install
dbt 1.8.0.
To build and activate the virtual environment:
```shell
tox -e dbt180 --devenv .venv
source .venv/bin/activate
```
(The `dbt180` environment is a good default choice.
However any version can be installed by replacing `dbt180` with
`py`, `py39` through `py313`, `dbt140` through `dbt190`, etc.
`py` defaults to the python version that was used to install tox.
To be able to run all tests including the dbt templater,
choose one of the dbt environments.)
Windows users should call `.venv\Scripts\activate` rather than `source .venv/bin/activate`.
They may also want to substitute `winpy` for `py` in the commands above.
This virtual environment will already have the package installed in editable
mode for you, as well as `requirements_dev.txt` and `plugins/sqlfluff-plugin-example`.
Additionally if a dbt virtual environment was specified, you will also have
`dbt-core`, `dbt-postgres`, and `plugins/sqlfluff-templater-dbt` available.
A different dbt plugin can be selected by changing the appropriate file under `constraints`
for the desired package and version.
#### Developing in Docker
To build a simple interactive Docker container, run the following commands:
```shell
make build
make shell
```
This container installs all Python dependencies, and mounts the project directory into
the container. It installs SQLFluff in editable mode. The nuts and bolts are in
place such that git should work seamlessly inside the container. It'll also install
the dbt templater plugin.
### Wiki
We have a [GitHub wiki](https://github.com/sqlfluff/sqlfluff/wiki) with some
more long form tutorials for contributors, particularly those new to SQLFluff
or contributing to open source. We welcome
[contributions, suggestions or requests](https://github.com/sqlfluff/sqlfluff/issues/2104)
for the wiki.
### Developing plugins
If you're working on plugins (like the dbt templater), you'll also need to install
those plugins too in an editable mode. This works the same way as the main project
but you'll need to do each one explicitly. e.g.
```shell
pip install -e plugins/sqlfluff-templater-dbt/.
```
> NOTE: For packages intended to be installed like this, the source code must be directly
> within a subdirectory with the name of the package and not in a subdirectory such as
> src. This is due to a restriction in the implementation of setup.py in editable mode.
### Testing
To test locally, SQLFluff uses `tox` (check the [requirements](#requirements)!).
The test suite can be run via:
```shell
tox
```
This will build and test for several Python versions, and also lint the project.
Practically on a day-to-day basis, you might only want to lint and test for one
Python version, so you can always specify a particular environment. For example,
if you are developing in Python 3.9 you might call...
```shell
tox -e generate-fixture-yml,py39,linting,mypy
```
...or if you also want to see the coverage reporting...
```shell
tox -e generate-fixture-yml,cov-init,py39,cov-report,linting,mypy
```
> NB: The `cov-init` task clears the previous test results, the `py39` environment
> generates the results for tests in that Python version and the `cov-report`
> environment reports those results out to you (excluding dbt).
`tox` accepts `posargs` to allow you to refine your test run, which is much
faster while working on an issue, before running full tests at the end.
For example, you can run specific tests by making use of the `-k` option in `pytest`:
```
tox -e py39 -- -k AL02 test
```
Alternatively, you can also run tests from a specific directory or file only:
```
tox -e py39 -- test/cli
tox -e py39 -- test/cli/commands_test.py
```
You can also manually test your updated code against a SQL file via:
```shell
sqlfluff parse test.sql
```
(ensure your virtual environment is activated first).
#### How to use and understand the test suite
When developing for SQLFluff, you may not need (or wish) to run the whole test
suite, depending on what you are working on. Here are a couple of scenarios
for development, and which parts of the test suite you may find most useful.
1. For dialect improvements (i.e. changes to anything in [src/sqlfluff/dialects](./src/sqlfluff/dialects))
you should not need to continuously run the full core test suite. Running
either `tox -e generate-fixture-yml` (if using tox), or setting up a python
virtualenv and running `test/generate_parse_fixture_yml.py` directly will
usually be sufficient. Both of these options accept arguments to restrict
runs to specific dialects to further improve iteration speed. e.g.
- `tox -e generate-fixture-yml -- -d mysql` will run just the mysql tests.
- `python test/generate_parse_fixture_yml.py -d mysql` will do the same.
2. Developing for the dbt templater should only require running the dbt test
suite (see below).
3. Developing rules and rule plugins there are a couple of scenarios.
- When developing a new rule or working with a more isolated rule, you
should only need to run the tests for that rule. These are usually what
are called the _yaml tests_. This refers to a body of example sql
statements and potential fixes defined in a large set of yaml files
found in [test/fixtures/rules/std_rule_cases](./test/fixtures/rules/std_rule_cases).
The easiest way to run these is by calling that part of the suite
directly and filtering to just that rule. For example:
- `tox -e py39 -- test/rules/yaml_test_cases_test.py -k AL01`
- `pytest test/rules/yaml_test_cases_test.py -k AL01`
- When developing on some more complicated rules, or ones known to
have interactions with other rules, there are a set of rule fixing
tests which apply a set combination of those rules. These are best
run via the `autofix` tests. For example:
- `tox -e py39 -- test/rules/std_fix_auto_test.py`
- `pytest test/rules/std_fix_auto_test.py`
- Potentially even the full rules suite `tox -e py39 -- test/rules`
- A small number of core rules are also used in making sure that inner
parts of SQLFluff are also functioning. This isn't great isolation
but does mean that occasionally you may find side effects of your
changes in the wider test suite. These can usually be caught by
running the full `tox -e py39` suite as a final check (or using the
test suite on GitHub when posting your PR).
4. When developing the internals of SQLFluff (i.e. anything not
already mentioned above), the test suite typically mirrors the structure
of the internal submodules of sqlfluff:
- When working with the CLI, the `sqlfluff.cli` module has a test suite
called via `tox -e py39 -- test/cli`.
- When working with the templaters (i.e. `sqlfluff.core.templaters`), the
corresponding test suite is found via `tox -e py39 -- test/core/templaters`.
- This rough guidance and may however not apply for all of the internals.
For example, changes to the internals of the parsing module (`sqlfluff.core.parser`)
are very likely to have knock-on implications across the rest of the test
suite and it may be necessary to run the whole thing. In these
situations however you can usually work slowly outward, for example:
1. If your change is to the `AnyOf()` grammar, first running `tox -e py39 -- test/core/parser/grammar_test.py` would be wise.
2. ...followed by `tox -e py39 -- test/core/parser` once the above is passing.
3. ...and then `tox -e py39 -- test/core`.
4. ...and finally the full suite `tox -e py39`.
#### dbt templater tests
The dbt templater tests require a locally running Postgres instance. See the
required connection parameters in `plugins/sqlfluff-templater-dbt/test/fixtures/dbt/profiles.yml`.
We recommend using https://postgresapp.com/.
To run the dbt-related tests you will have to explicitly include these tests:
```shell
tox -e cov-init,dbt019-py39,cov-report-dbt -- plugins/sqlfluff-templater-dbt
```
For more information on adding and running test cases see the [Parser Test README](test/fixtures/dialects/README.md) and the [Rules Test README](test/fixtures/rules/std_rule_cases/README.md).
#### Running dbt templater tests in Docker
The development Docker container has the dbt templater plugin installed,
so you can run the tests inside the container. Inside the container, run:
```
pytest -v plugins/sqlfluff-templater-dbt/test/
```
### Pre-Commit Config
For development convenience we also provide a `.pre-commit-config.yaml` file
to allow the user to install a selection of pre-commit hooks by running (check
the [requirements](#requirements) before running this):
```
tox -e pre-commit -- install
```
These hooks can help the user identify and fix potential linting/typing
violations prior to committing their code and therefore reduce having to deal
with these sort of issues during code review.
### Documentation Website
Documentation is built using Sphinx with some pages being built based on the
source code. See the [Documentation Website README.md](./docs/README.md) file
for more information on how to build and test this.
### Building Package
New versions of SQLFluff will be published to PyPI automatically via
[GitHub Actions](.github/workflows/publish-release-to-pypi.yaml)
whenever a new release is published to GitHub.
#### Release checklist:
The [release page](https://github.com/sqlfluff/sqlfluff/releases) shows
maintainers all merges since last release. Once we have a long enough list,
we should prepare a release.
A release PR can be created by maintainers via the
["Create release pull request" GitHub Action](https://github.com/sqlfluff/sqlfluff/actions/workflows/create-release-pull-request.yaml). Once this is done, it's a good idea to
put a short post on the #contributing channel on slack so that people know
there will be a release soon.
As further PRs are merged, we may need to rerun the release script again
(or alternatively just manually updating the branch). This can only be rerun
locally (the GitHub Action will exit error if the branch already exists to
prevent overwriting it).
Check out the release branch created by the GitHub Action locally and run
the script. It will preserve any `Highlights` you have added and update the
other sections with new contributions. It can be run as follows (you will
need a [GitHub Personal Access Token](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/creating-a-personal-access-token) with read & write permissions on the "Content"
scope, and read permissions on the "Metadata" scope. The reason we need
both read & write access on the "Content" scope is that only tokens with
write access can see _draft_ releases, which is what we need access to).
All maintainers should have sufficient access to generate such a token. Once
generated, store it in your env (zshrc, etc) as `SQLFLUFF_GITHUB_TOKEN`. Then:
```shell
make shell
make release 3.4.2 # Change to your release number
```
When all of the changes planned for the release have been merged, and
the release PR is up to date. The maintainer running the release should
write a short commentary in the `CHANGELOG.md` file, describing any features
of note, and also celebrating new contributors. Check out old releases
for inspiration!
When the PR is ready. Merge it. Once merged, follow these steps to actually
publish the final release.
1. Edit the draft release on GitHub. Copy across the summary from the
`CHANGELOG.md`
2. Update the title of the release to include the current date.
3. Check that the tag for the release is set to the release version. The tag
will not have been created yet, so click the option to create the tag on
release.
4. If this is an alpha release, make sure to set the "this is a pre-release"
option on the release. Otherwise leave it blank.
5. When you're ready, click "Publish Release" and that will set off the automated
processes to publish the release to PyPi and Dockerhub.
6. Finally, drop a note on #general on slack to let the community know there's
a new release out.
Then you're done!
|