Catégorie : Notes
-
The Jupyter+git problem is now solved · fast.ai
https://www.fast.ai/2022/08/25/jupyter-git/
-
GitHub – eugeneyan/applied-ml: 📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
https://github.com/eugeneyan/applied-ml
-
GitHub – JasonKessler/scattertext: Beautiful visualizations of how language differs among document types.
https://github.com/JasonKessler/scattertext
-
GitHub – axa-group/Parsr: Transforms PDF, Documents and Images into Enriched Structured Data
https://github.com/axa-group/Parsr
-
GitHub – gordicaleksa/stable_diffusion_playground: Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can generate and then later interpolate between the images of your choice.
https://github.com/gordicaleksa/stable_diffusion_playground
-
2 September, 2022 11:40
https://www-technologyreview-com.cdn.ampproject.org/c/s/www.technologyreview.com/2022/08/09/1057171/social-media-polluting-society-moderation-alone-wont-fix-the-problem/amp/
-
AutoRegex: Convert from English to RegEx with Natural Language Processing
https://www.autoregex.xyz/
-
André Staltz – Time Till Open Source Alternative
https://staltz.com/time-till-open-source-alternative.html
-
A Topic Modeling Comparison Between LDA, NMF, Top2Vec, and BERTopic to Demystify Twitter Posts
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9120935/
-
An approach for implementing and deploying Graph Deep Learning Models in production | by Cesar A. Charalla Olazo | Rappi Tech
https://engineering.rappi.com/an-approach-for-implementing-and-deploying-production-graph-deep-learning-models-ad52c6b7a481?gi=2867eb1b8db3
-
ML Education at Uber: Frameworks Inspired by Engineering Principles | Uber Blog
https://www.uber.com/en-PL/blog/ml-education-at-uber/
-
Top Automated Feature Engineering Frameworks in Python in 2022
https://moez-62905.medium.com/top-automated-feature-engineering-frameworks-in-python-in-2022-9899d7b18f7e
-
A Practical Guide to ARIMA Models using PyCaret
https://towardsdatascience.com/a-practical-guide-to-arima-models-using-pycaret-part-3-823abb5359a7 A Practical Guide to ARIMA Models using PyCaret — Part 3 2️⃣️ Understanding the Difference Term using PyCaret. 👉 Step 1: Setup PyCaret Time Series Experiment. In order to understand this concept better, we will use a random walk dataset from pycaret playground. Details can be found in the Jupyter notebook for this article…
-
GitHub – megvii-research/NAFNet: The state-of-the-art image restoration model without nonlinear activation functions.
https://github.com/megvii-research/NAFNet
-
TorchStudio IDE for PyTorch and its ecosystem
https://www.torchstudio.ai/ TorchStudio Flexible. Edit or add any module: dataset, model, analyzer, loss, metric, etc. Save all data and parameters to compatible formats. www.torchstudio.ai
-
Automatic termination for hyperparameter optimization – Amazon Science
https://www.amazon.science/publications/automatic-termination-for-hyperparameter-optimization
-
Pegasus – Investigating Efficiently Extending Transformers for Long Input Summarization
https://github.com/google-research/pegasus/tree/main/pegasus/flax pegasus/pegasus/flax at main · google-research/pegasus Contribute to google-research/pegasus development by creating an account on GitHub. github.com
-
15 August, 2022 13:08
https://towardsdatascience.com/understanding-arima-models-using-pycarets-time-series-module-part-1-692e10ca02f2
-
GitHub – nnaisense/evotorch: EvoTorch is an advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.
https://github.com/nnaisense/evotorch
-
12 August, 2022 00:27
https://www-technologyreview-com.cdn.ampproject.org/c/s/www.technologyreview.com/2022/08/09/1057171/social-media-polluting-society-moderation-alone-wont-fix-the-problem/amp/
-
10 August, 2022 23:21
https://www.freecodecamp.org/news/transform-machine-learning-models-into-native-code-with-zero-dependencies/
-
Understand BLOOM, the Largest Open-Access AI, and Run It on Your Local Computer | by Cristian Arteaga | Aug, 2022 | Towards Data Science
https://towardsdatascience.com/run-bloom-the-largest-open-access-ai-model-on-your-desktop-computer-f48e1e2a9a32
-
Home – KServe Documentation Website
https://kserve.github.io/website/0.9/
-
Amazon wins best-paper award at first AutoML conference – Amazon Science
https://www-amazon-science.cdn.ampproject.org/c/s/www.amazon.science/blog/amazon-wins-best-paper-award-at-first-automl-conference?_amp=true
-
3 August, 2022 10:24
https://huggingface.co/blog/nystromformer
-
The Need for a Kubernetes Alternative
https://dzone.com/articles/the-need-for-a-kubernetes-alternative The Need for a Kubernetes Alternative – DZone Cloud Kubernetes has become a household name for container orchestration. There is no denying that it has risen to become an ideal solution for many developers. dzone.com
-
Accelerate Sentence Transformers with Hugging Face Optimum
https://www.philschmid.de/optimize-sentence-transformers
-
Big dataset of images from instagram – 1,211,625 posts
https://www.kaggle.com/datasets/shmalex/instagram-images Instagram Images – 1,211,625 posts Big dataset of images from instagram www.kaggle.com
-
Integrating B2C feature of Microsoft identity platform with a Python web application
ms-identity-python-webapp/README_B2C.md at master · Azure-Samples/ms-identity-python-webapp (github.com) ms-identity-python-webapp/README_B2C.md at master – GitHub Integrating B2C feature of Microsoft identity platform with a Python web application About this sample. This sample was initially developed as a web app to demonstrate how to integrate Microsoft identity platform with a Python web application.The same code base can also be used…
-
4 Pandas Anti-Patterns to Avoid and How to Fix Them
https://www.aidancooper.co.uk/pandas-anti-patterns/ 4 Pandas Anti-Patterns to Avoid and How to Fix Them pandas is a powerful data analysis library with a rich API that offers multiple ways to perform any given data manipulation task. Some of these approaches are better than others, and pandas users often learn suboptimal coding practices that become their default workflows. www.aidancooper.co.uk
-
GloVe Emoji Embeddings Emoji2Vec (pretrained) | Kaggle
https://www.kaggle.com/datasets/gowrishankarp/glove-emoji-embeddings-emoji2vec-pretrained
-
Introducing Theseus, a library for encoding domain knowledge in end to end AI models
https://ai.facebook.com/blog/theseus-a-library-for-encoding-domain-knowledge-in-end-to-end-ai-models/?utm_source=linkedin&utm_medium=organic_social&utm_campaign=blog
-
Using A Large Language Model For Entity Extraction | by Cobus Greyling | Jul, 2022 | Medium
https://cobusgreyling.medium.com/using-a-large-language-model-for-entity-extraction-6fffb988eb15
-
Scalable Efficient Big Data Pipeline Architecture | Towards Data Science
https://towardsdatascience.com/scalable-efficient-big-data-analytics-machine-learning-pipeline-architecture-on-cloud-4d59efc092b5
-
Hugging Face Diffusers Explained | Towards Data Science
https://towardsdatascience.com/hugging-face-just-released-the-diffusers-library-846f32845e65
-
How to enhance Google Search Console data exports with Streamlit
https://blog.streamlit.io/how-to-enhance-google-search-console-data-exports-with-streamlit/
-
How to Design the Most Powerful Graph Neural Network | Towards Data Science
https://towardsdatascience.com/how-to-design-the-most-powerful-graph-neural-network-3d18b07a6e66
-
Supporting Arbitrary ML Models with MlFlow (Using PyCaret as an Example)
https://medium.com/dkatalis/supporting-arbitrary-ml-models-with-mlflow-using-pycaret-as-an-example-a26fa1b3ac38
-
Faster Text Generation with TensorFlow and XLA
https://huggingface.co/blog/tf-xla-generate
-
6 Hierarchical Data Visualizations | by Kruthi Krishnappa | Jul, 2022 | Towards Data Science
https://towardsdatascience.com/6-hierarchical-datavisualizations-98318851c7c5
-
Yann LeCun has a bold new vision for the future of AI
https://www.technologyreview.com/2022/06/24/1054817/yann-lecun-bold-new-vision-future-ai-deep-learning-meta/ Yann LeCun has a bold new vision for the future of AI The centerpiece of the new approach is a neural network that can learn to view the world at different levels of detail. Ditching the need for pixel-perfect predictions, this network would focus … www.technologyreview.com
-
GitHub – Lucaterre/spacyfishing: A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
https://github.com/Lucaterre/spacyfishing
-
LINUX Commands – XMind – Mind Mapping Software
https://www.xmind.net/m/WwtB/
-
Include diagrams in your Markdown files with Mermaid
https://github.blog/2022-02-14-include-diagrams-markdown-files-mermaid/ Include diagrams in your Markdown files with Mermaid Mermaid is a JavaScript based diagramming and charting tool that takes Markdown-inspired text definitions and creates diagrams dynamically in the browser. github.blog
-
BLOOM
https://bigscience.huggingface.co/blog/bloom
-
Using Neo4j Graph Data Science in Python to Improve Machine Learning Models | by Tomaz Bratanic | Neo4j Developer Blog | Jul, 2022 | Medium
https://medium.com/neo4j/using-neo4j-graph-data-science-in-python-to-improve-machine-learning-models-c55a4e15f530
-
GitHub – jessevig/bertviz: BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
https://github.com/jessevig/bertviz#-documentation
-
Kedro vs ZenML vs Metaflow: Which Pipeline Orchestration Tool Should You Choose? – neptune.ai
https://neptune.ai/blog/kedro-vs-zenml-vs-metaflow