Skip to content

PacktPublishing/Developing-Kaggle-Notebooks

Repository files navigation

Developing-Kaggle-Notebooks

This is the code repository for the book Developing Kaggle Notebooks: Paving your way to becoming a Kaggle Notebooks Grandmaster, published by Packt.

What is this book about?

Developing Kaggle Notebooks is here to introduce you to the wide world of data analysis, with a focus on using Kaggle Notebooks resources to help you achieve mastery in this field as well as rising to the top in the Kaggle Notebooks tier. The book is structured as a seven-step trip into the world of analysis, exploring the features available in Kaggle Notebooks alongside various data analysis techniques and different kinds of datasets.

What you will learn

  • Approach a new dataset or competition to perform a data analysis via a Notebook and get noticed
  • Start exploring a new source of data, from tools to use for ingestion to treating various issues with ingested data
  • Structure your code using reusable components
  • Perform a deep dive for both small and large datasets of various types
  • Differentiate yourself from the crowd with the content of your analysis
  • Improve the style of your Notebook: color scheme, content organization, visual effects, and theme
  • Use storytelling techniques to captivate your audience, improve the clarity of the presentation, and raise its impact

Who This Book Is for?

This book is suitable for a wide audience with a keen interest in data science and machine learning and those who want to use Kaggle Notebooks to improve their skills and rise in the Kaggle Notebooks ranks. Beginners on Kaggle from any background will benefit Seasoned contributors who want to improve various skills like ingestion, preparation, exploration, and visualization Expert contributors who would like to learn from the Grandmasters to rise into the upper Kaggle rankings Professionals who already use Kaggle for learning and competing

Notebooks in each chapter

The following are links for the notebooks associated with each chapter. The first column in the table gives the chapter. The 2nd column gives the link to the resource (notebook or utility script) in the book repository. The 3rd column gives the link to the resource on Kaggle. By following the link for a notebook on Kaggle, you can directly fork the notebook and start using it directly on the platform.

Chapter no.Chapter titleNotebook/Utility Script (GitHub)Open it on Kaggle
3Starting Our Travel – Surviving the Titanic DisasterTitanic - Start of a Journey Around Data WorldKaggle
4Take a Break and Have a Beer or Coffee in LondonCoffee or Beer in London - Your Choice!Kaggle
Every Pub in England - Data ExplorationKaggle
geospatial_utilsKaggle
plot_style_utilsKaggle
Starbucks Location Worldwide - Data ExplorationKaggle
5Get Back to Work and Optimize Microloans for Developing CountriesKiva Microloans - A Data ExplorationKaggle
Understand-poverty-to-optimize-microloansKaggle
6Can You Predict Bee Subspecies?data_quality_statsKaggle
Honeybee Subspecies ClassificationKaggle
plotly_utilsKaggle
7Text Analysis Is All You NeedJigsaw Comments Text ExplorationKaggle
Jigsaw Simple BaselineKaggle
8Analyzing Acoustic Signals to Predict the Next Simulated EarthquakeJLANL Earthquake Data Exploration and BaselineKaggle
9Can You Find Out Which Movie Is a Deepfake?DeepFake Exploratory Data AnalysisKaggle
face_detection_mtcnnKaggle
face_object_detectionKaggle
video_utilsKaggle
10Unleash the Power of Generative AI with Kaggle ModelsRAG using Llama 2, Langchain and ChromaDBKaggle
Simple sequential chain with Llama 2 and LangchainKaggle
Test Llama 2 quantized with Llama.cppKaggle
Test LlaMa v2 with mathKaggle
Test of Llama 2 quantized with llama.cpp (on CPU)Kaggle
Use Code Llama to generate Python code (13b)Kaggle
11Closing Our Journey: How to Stay Relevant and on TopCoffee or Beer in London - Your Choice (improved)!Kaggle
How active are the users on Kaggle?Kaggle

Know more on the Discord server Coding

You can get more engaged on the discord server for more latest updates and discussions in the community at Discord

Download a free PDF Coding

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost. Simply click on the link to claim your free PDF.Free-EbookCoding

We also provide a PDF file that has color images of the screenshots/diagrams used in this book at GraphicBundleCoding

Get to know the Authors

Dr. Gabriel Preda is a Principal Data Scientist for Endava, a major software services company. He has worked on projects in various industries, including financial services, banking, portfolio management, telecom, and healthcare, developing machine learning solutions for various business problems, including risk prediction, churn analysis, anomaly detection, task recommendations, and document information extraction. In addition, he is very active in competitive machine learning, currently holding the title of a three-time Kaggle Grandmaster and is well-known for his Kaggle Notebooks.

Other Related Books

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
close