Catégorie : Notes
-
Tensorflow examples and tutorials
Tensorflow.pdf
-
GitHub – ritchieng/the-incredible-pytorch: The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
https://github.com/ritchieng/the-incredible-pytorch
-
The Most Complete Guide to PyTorch for Data Scientists | by Rahul Agarwal | Sep, 2020 | Towards Data Science
https://towardsdatascience.com/minimal-pytorch-subset-for-deep-learning-for-data-scientists-8ccbd1ccba6b
-
This know-it-all AI learns by reading the entire web nonstop | MIT Technology Review
https://www.technologyreview.com/2020/09/04/1008156/knowledge-graph-ai-reads-web-machine-learning-natural-language-processing/
-
Microsoft Offers New Documentation for Blazor and gRPC in ASP.NET Core — Visual Studio Magazine
https://visualstudiomagazine.com/articles/2020/09/03/blazor-grpc-docs.aspx?m=1
-
GitHub – neomatrix369/nlp_profiler: A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular sta…
https://github.com/neomatrix369/nlp_profiler
-
Microsoft Offers New Documentation for Blazor and gRPC in ASP.NET Core — Visual Studio Magazine
https://visualstudiomagazine.com/articles/2020/09/03/blazor-grpc-docs.aspx?m=1
-
Elastic Transformers
Making BERT stretchy Scalable Semantic Search on a Jupyter Notebook https://medium.com/@mihail.dungarov/elastic-transformers-ae011e8f5b88 https://github.com/md-experiments/elastic_transformers
-
Natural Language Processing: Intelligent Search through text using Spacy and Python | by Akash Chauhan | Aug, 2020 | Towards Data Science
https://towardsdatascience.com/natural-language-processing-document-search-using-spacy-and-python-820acdf604af
-
GitHub – neomatrix369/nlp_profiler: A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular sta…
https://github.com/neomatrix369/nlp_profiler
-
GitHub – alirezamika/autoscraper: A Smart, Automatic, Fast and Lightweight Web Scraper for Python
https://github.com/alirezamika/autoscraper
-
6 September, 2020 22:21
https://medium.com/@st3llasia/analyzing-arxiv-data-using-neo4j-part-1-ccce072a2027
-
GitHub – eugeneyan/applied-ml: 📚 Papers & articles of companies sharing their work on a pplied data science & machine learning.
https://github.com/eugeneyan/applied-ml
-
6 September, 2020 08:08
https://medium.com/dataseries/facebooks-pygraph-is-an-open-source-framework-for-capturing-knowledge-in-large-graphs-b52c0fb902e8
-
5 September, 2020 08:42
https://towardsdatascience.com/youre-living-in-1985-if-you-don-t-use-docker-for-your-data-science-projects-858264db0082
-
OpenAI reveals the pricing plans for its API — and it ain’t cheap
https://thenextweb.com/neural/2020/09/03/openai-reveals-the-pricing-plans-for-its-api-and-it-aint-cheap/
-
Stop One-Hot Encoding Your Categorical Variables.
https://towardsdatascience.com/stop-one-hot-encoding-your-categorical-variables-bbb0fba89809 Stop One-Hot Encoding Your Categorical Variables. | by Andre Ye | Aug, 2020 | Towards Data Science One-hot encoding, otherwise known as dummy variables, is a method of converting categorical variables into several binary columns, where a 1 indicates the presence of that row belonging to that… towardsdatascience.com
-
7 Women in Data Science You Should Be Following on LinkedIn
https://towardsdatascience.com/7-women-you-should-be-following-on-linkedin-737362a7777f 7 Women in Data Science You Should Be Following on LinkedIn | by Kurtis Pykes | Aug, 2020 | Towards Data Science Preceding this post I shared 8 Folks You Should Be Following On LinkedIn. As I reached the ending of the post I realized “Wait a minute… I have not made a single…
-
Scattertext 0.0.2.67
https://github.com/RasaHQ/whatlies
-
How to choose a cloud machine learning platform
12 capabilities every cloud machine learning platform should provide to support the complete machine learning lifecycle https://www.infoworld.com/article/3568889/how-to-choose-a-cloud-machine-learning-platform.html
-
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
https://github.com/cliang1453/BOND cliang1453/BOND BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision – cliang1453/BOND github.com
-
Plotly Python Open Source Graphing Library
https://plotly.com/python/#ai_ml Plotly Python Graphing Library | Python | Plotly Interactive Data Analysis with FigureWidget ipywidgets. View Tutorial. Click Events plotly.com
-
From Analytics to Data Storytelling: An Advanced Leap of Data-Driven Industry
https://www.analyticsinsight.net/analytics-data-storytelling-advanced-leap-data-driven-industry/?suid=SU00031&medium=li&cmp=401
-
Scattertext 0.0.2.67
https://github.com/JasonKessler/scattertext GitHub – JasonKessler/scattertext: Beautiful visualizations of how language differs among document types. Using Scattertext as a text analysis library: finding characteristic terms and their associations. The following code creates a stand-alone HTML file that analyzes words used by Democrats and Republicans in the 2012 party conventions, and outputs some notable term associations. github.com
-
21 August, 2020 07:53
https://www.oreilly.com/radar/why-best-of-breed-is-a-better-choice-than-all-in-one-platforms-for-data-science/
-
The NLP Model Forge – Unlocking Inference for 1,400 NLP Models
https://medium.com/towards-artificial-intelligence/the-nlp-model-forge-a46faac7b5b0 The NLP Model Forge Unlocking Inference for 1,400 NLP Models medium.com
-
Announcing the Consortium for Python Data API Standards
https://data-apis.org/blog/announcing_the_consortium/
-
Machine Learning University: Accelerated Natural Language Processing Class
https://github.com/aws-samples/aws-machine-learning-university-accelerated-nlp
-
Announcing Neo4j Aura on Google Cloud Platform
https://neo4j.com/blog/announcing-neo4j-aura-on-google-cloud-platform/ Announcing Neo4j Aura on Google Cloud Platform Today, Neo4j is proud to announce the general availability of Neo4j Aura™ on Google Cloud Platform (GCP).Neo4j Aura is the first and only integrated graph database service on GCP. If you haven’t seen it yet, Aura is the simplest way to run Neo4j in the cloud. Completely…
-
DeText: A Deep Neural Text Understanding Framework
DeText is a Deep Text understanding framework for NLP related ranking, classification, and language generation tasks. It leverages semantic matching using deep neural networks to understand member intents in search and recommender systems. As a general NLP framework, currently DeText can be applied to many tasks, including search & recommendation ranking, multi-class classification and query…
-
FastAPI – NLP as a Service
Project Insight is designed to create NLP as a service with code base for both front end GUI (streamlit) and backend server (FastApi) the usage of transformers models on various downstream NLP task. The downstream NLP tasks covered: News Classification Entity Recognition Sentiment Analysis Summarization Information Extraction To Do The user can select different models…
-
Topic Modeling with Gensim
https://www.machinelearningplus.com/nlp/topic-modeling-gensim-python/ Gensim Topic Modeling – A Guide to Building Best LDA models – Machine Learning Plus Topic Modeling is a technique to understand and extract the hidden topics from large volumes of text. Latent Dirichlet Allocation(LDA) is an algorithm for topic modeling, which has excellent implementations in the Python’s Gensim package. This tutorial tackles the…
-
18 August, 2020 09:23
https://www.machinelearningplus.com/nlp/topic-modeling-python-sklearn-examples/
-
Zero-Shot Learning in Modern NLP
https://joeddav.github.io/blog/2020/05/29/ZSL.html Zero-Shot Learning in Modern NLP | Joe Davison Blog A latent embedding approach. A common approach to zero shot learning in the computer vision setting is to use an existing featurizer to embed an image and any possible class names into their corresponding latent representations (e.g. Socher et al. 2013).They can then take some…
-
Create Beautiful Interactive Visualisations in Python | by Rebecca Vickery | Aug, 2020 | Towards Data Science
https://towardsdatascience.com/create-beautiful-interactive-visualisations-in-python-f8517dc7ae5c
-
GitHub – sacmehta/delight: DeLighT: Very Deep and Light-Weight Transformers
https://github.com/sacmehta/delight
-
10 Must-Try Open Source Tools for Machine Learning | by Oleksii Kharkovyna | Towards Data Science
https://towardsdatascience.com/10-must-try-open-source-tools-for-machine-learning-1c4420ef40df
-
https://medium.com/@yassine.hamdaoui/text-classification-using-transformers-pytorch-implementation-5ff9f21bd106
https://medium.com/@yassine.hamdaoui/text-classification-using-transformers-pytorch-implementation-5ff9f21bd106
-
From stock market email newsletter side project to micro SaaS
https://bullish.email/blog/building-a-micro-saas-with-mailerlite-netlify-stripe-and-zapier/ From stock market email newsletter side project to micro SaaS A couple of months back, during this crazy world pandemic, I had an idea for a Stock Market email newsletter. bullish.email
-
GitHub – ivan-bilan/The-NLP-Pandect: A comprehensive reference for all topics related to Natural Language Processing
https://github.com/ivan-bilan/The-NLP-Pandect
-
GitHub – renatoviolin/Multiple-Choice-Question-Generation-T5-and-Text2Text: Question Generation using Google T5 and Text2Text
https://github.com/renatoviolin/Multiple-Choice-Question-Generation-T5-and-Text2Text
-
2007.05558 The Computational Limits of Deep Learning
https://arxiv.org/abs/2007.05558
-
GitHub – neuml/txtai: AI-powered search engine
https://github.com/neuml/txtai
-
Libra: Ergonomic machine learning
https://libradocs.github.io/ About Libra Libra is the nexus of modern machine learning. We’ve combined technologies from the most popular platforms to create a complete experience. Keras: straightforward model building techniques for improved modularity and ease of deployment. TensorFlow: core computational fundamentals and detailed functionality. PyTorch: scalable training for highly-dimensional processes. libradocs.github.io
-
The Roots of Data Science
https://towardsdatascience.com/the-roots-of-data-science-77c71115229 The Roots of Data Science. How it all began | by Favio Vázquez | Aug, 2020 | Towards Data Science John Tukey is one of the most important statisticians in history. In the fantastic article “The Future of Data Analysis” he said this: For a long time I have thought I was a statistician,…
-
Vaex: Out of Core Dataframes for Python and Fast Visualization | by Maarten Breddels | Towards Data Science
https://towardsdatascience.com/vaex-out-of-core-dataframes-for-python-and-fast-visualization-12c102db044a
-
Google breaks AI performance records in MLPerf with world’s fastest training supercomputer
https://cloud.google.com/blog/products/ai-machine-learning/google-breaks-ai-performance-records-in-mlperf-with-worlds-fastest-training-supercomputer Google wins MLPerf benchmark contest with fastest ML training supercomputer | Google Cloud Blog Table 1: All of these MLPerf submissions trained from scratch in 33 seconds or faster on Google’s new ML supercomputer. 2. Training at scale with TensorFlow, JAX, Lingvo, and XLA. Training complex ML models using thousands of TPU chips required…
-
How to identify the right independent variables for Machine Learning Supervised Algorithms? | by Kaushik Choudhury | Aug, 2020 | Towards Data Science
https://towardsdatascience.com/how-to-identify-the-right-independent-variables-for-machine-learning-supervised-algorithms-439986562d32