|Spark and Docker: Your Spark development cycle just got 10x faster ! – Data Mechanics Blog
Native support for Docker is in fact one of the main reasons companies choose to deploy Spark on top of Kubernetes instead of YARN. In this article, we will illustrate the benefits of Docker for Apache Spark by going through the end-to-end development cycle used by many of our users at Data Mechanics.
|How to Build a Scalable Data Analytics Pipeline
As the data keeps growing in volume the data analytics pipelines have to be scalable to adapt the rate of change. And for this reason, choosing to set up a the pipeline in cloud makes perfect sense (since cloud offers on-demand scalability and flexibility). In this article I will demystify how to build a scalable and adaptable data processing pipeline in Google Cloud.
|ML Deployment Decision Tree
Choose the right tool for your job
|Topic modeling visualization – How to present results of LDA model? | ML+ – Machine Learning Plus
In this post, we discuss techniques to visualize the output and results from topic model (LDA) based on the gensim package.. Topic modeling visualization – How to present the results of LDA models?
|GitHub CLI 1.0 is now available – The GitHub Blog
GitHub CLI brings GitHub to your terminal. It reduces context switching, helps you focus, and enables you to more easily script and create your own workflows. Earlier this year, we announced the beta of GitHub
|GitHub – JRC1995/Abstractive-Summarization: Implementation of abstractive summarization using LSTM in the encoder-decoder architecture with local attention.
Implementation of abstractive summarization using LSTM in the encoder-decoder architecture with local attention. – JRC1995/Abstractive-Summarization