<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>PySpark on Shubham | Full Stack &amp; AI Engineer</title><link>https://w3shubh.com/tags/pyspark/</link><description>Recent content in PySpark on Shubham | Full Stack &amp; AI Engineer</description><generator>Hugo</generator><language>en</language><lastBuildDate>Tue, 15 Apr 2025 00:00:00 +0530</lastBuildDate><atom:link href="https://w3shubh.com/tags/pyspark/index.xml" rel="self" type="application/rss+xml"/><item><title>6sense: From Hackathons to AI Pipelines</title><link>https://w3shubh.com/posts/pandora/</link><pubDate>Tue, 15 Apr 2025 00:00:00 +0530</pubDate><guid>https://w3shubh.com/posts/pandora/</guid><description>A 4.5-year journey scaling frontend architectures, merging acquisitions, building big data pipelines, and shipping Generative AI infrastructure.</description></item><item><title>Big Data Engineering: Optimizing PySpark Pipelines at 6sense</title><link>https://w3shubh.com/posts/big-data-pipelines/</link><pubDate>Sun, 01 Oct 2023 00:00:00 +0530</pubDate><guid>https://w3shubh.com/posts/big-data-pipelines/</guid><description>Owned critical big data infrastructure handling millions of signals daily. Optimized a core PySpark DAG saving 45 mins/day and retired legacy systems to save $130K annually.</description></item></channel></rss>