-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sharing data sets between chapters #16
Comments
I personally have no exposure to what people have been doing. I like the idea of coordinating on datasets and examples, but have made no concrete steps in this direction. Perhaps this issue is such a step? If others are around it might be interesting to list both our constraints for datasets for our sections as well as some datasets that we know about and appreciate. For example for dask we have the following constraints:
Datasets that we've frequently used in tutorials and examples include the following:
|
+1 For SciPy we are pretty flexible in terms of datasets to use. We do need:
|
We're using the measles incidence dataset highlighted in the Wall Street Journal a while back in our chapter (#26), along with some NYC taxi data, if anyone wants to use those. |
From Debra's email: Matt Rocklin suggested using some data sets in common through the book, so feel free to coordinate with others on the project. The Dask chapter will also be written using the data and projects described in some of the other chapters.
@mrocklin do you have an overview of data sets already in use? For the SciPy chapter we'd be happy to reuse something as well.
The text was updated successfully, but these errors were encountered: