Codeberg was asking about this. The linked toot by a commenter points to :
These are CC-BY-SA 4.0 remixes of the Stack Exchange Creative Commons Data Dumps. 100% Unendorsed by Stack Exchange, Inc.
They are minimal. They provide the data you probably care about and the data you need to comply with the original license in SQLite format.
Because that’s the nature of FOSS. The good news is, if they trained on you data that’s licensed CC BY-SA (as all SO content is), then you can request their source code, and they legally must provide it.
This is a good thing.