Some photography: Paris in the morning
The Pantheon, in the morning haze, as seen from probabl's office in La Tour Montparnasse
(click through for highres)
https://www.flickr.com/photos/gaelvaroquaux/53971598638/in/dateposted-public/
The scikit-learn @sklearn library is used by millions. And so I was thrilled to meet with some of the people behind this project for this new #code4thought [EN] episode: Adrin @adrin@kolektiva.social, Gaël @GaelVaroquaux, Guillaume @glemaitre and Lucy. Listen to it on your podcast app or on YouTube podcast or directly on https://codeforthought.buzzsprout.com/1326658/14785355-en-scikit-learn-software-is-people
"mémoriser est une chose, savoir généraliser en est une autre. Par exemple, si je mémorise toutes les additions entre deux nombres plus petits que dix, je ne sais pas extrapoler au-delà. Pour aller plus loin, il faut que je maîtrise la logique de l'addition… ou que je mémorise plus." https://www.lesechos.fr/idees-debats/sciences-prospective/les-ia-raisonnent-elles-ou-recitent-elles-2103079 par @GaelVaroquaux
I finally found time for photography:
"Jazzy beats: Grandbrothers"
https://www.flickr.com/photos/gaelvaroquaux/53572808837/
I took this picture with my mobile phone, during a concert at La Cigale (click through for full picture / highres)
I ran a quick Gradient Boosted Trees vs Neural Nets check using scikit-learn's dev branch which makes it more convenient to work with tabular datasets with mixed numerical and categorical features data (e.g. the Adult Census dataset).
Let's start with the GBRT model. It's now possible to reproduce the SOTA number of this dataset in a few lines of code 2 s (CV included) on my laptop.
1/n
#sklearn #PyData #MachineLearning #TabularData #GradientBoosting #DeepLearning #Python
Software systems, more than any other engineering activity, create a technological world that results from social dynamics and constructs.
This is because the space of possibilities is much wider, and there are many more objects interacting than in other industrial endeavors.
Big thinkers of urban planning, designing spaces and cities accounting for interactions connected their thinking with sociology and related.
People thinking software at the ecosystem level probably should do the same.
Avec la #LoiImmigration, le gouvernement manie la xénophobie, et veut inscrire la discrimination dans la loi.
C'est le programme de l'extrême droite, un programme de division et non de construction, un programme qui met notre démocratie sur une pente dangereuse.
Une interview sur scikit-learn : la vision du projet, comment penser à l'impact, au lien avec la société, à la dynamique open-source... 45mn où je parle de ce qui nous motive, de ce que nous avons appris sur les données et l'humain...
https://www.youtube.com/watch?v=I5RoWUyJgT8
Ce fut un grand plaisir, merci beaucoup à l'équipe, hymaïa dont Yoann Benoit.
Je me rends compte que j'ai une meilleure énonciation en français 🙂
A thread from @GaelVaroquaux looking at the impact of the community-driven sklearn compared to centralized corporate ML packages. Community isn't always fast or easy, but it can be very robust over the long term once it's established.
"People underestimate how impactful @sklearn continues to be" — François Chollet
https://twitter.com/GaelVaroquaux/status/1734629067322753239