I still want somebody to do a corpus that is essentially the Arctic construction but on Simple English Wikipedia because that would be a corpus that is strictly better than arctic
better yet, create a tool that does this for you for arbitrary amounts of desired data