Data Engineering career imposter syndrome and skill gaps
“I'm FAR from those godly DEs who have every call possible in spark memorized while they eat their daily masters degree in a framework which came out tomorrow.”
Real frustrations surfaced from 81 posts across Reddit, X, and Hacker News. Week of May 18–24 2026.
Data Engineering career imposter syndrome and skill gaps
“I'm FAR from those godly DEs who have every call possible in spark memorized while they eat their daily masters degree in a framework which came out tomorrow.”
Stakeholders ignoring dashboards for Excel exports
“The executive team to log in, ignore every carefully crafted chart, and immediately hunt for the "Export to CSV" button.”
Mismatch between academic analytics and corporate reality
“Most of analytics is figuring out which version of "the truth" your stakeholders are asking about. Same metric, three definitions, three teams arguing about it.”
Unmanaged technical debt from 'Self-Service' analytics
“It seems to lead to a massive sprawl of 50+ page dashboards where 90% of the tabs are just slightly different filters of the same broken logic.”
GenAI FOMO and counter-productive AI Agents
“Our team is building unnecessary agents and agentic tools and enforcing users and dependent business teams to use the tool, so basically we are building agentic tools which nobody's gonna use.”
Difficulty learning CI/CD and IaC in data roles
“My company uses AWS and GitLab, and we don't have many permissions to deploy much manually through the management console, everything has to go through CloudFormation and CI/CD pipelines. It's quite overwhelming.”
Power BI model maintenance and performance bloat
“The model has become quite messy — multiple fact tables, bidirectional filters everywhere, unclear relationships, etc.”
Lack of standardized analytics pipeline formats
“I’m quite confused (and probably naive) as to why there isn’t a seriously structured & comprehensive pipeline format that most/all data analysts use when selecting/executing their potential models.”
Manual extraction from unstructured PDFs and Web
“It is 6 hours of copy paste and we still get typos that break dashboards. The PDFs are all different formats.”
Experienced Data Engineers struggling to find new roles
“I’ve got about 12 years in data, with maybe 5 to 6 of those being data engineering... I’m getting interviews but not landing.”
Inconsistent compensation and HR data across systems
“Compensation data lives in three different systems that don't talk to each other. HR refuses to give direct access to payroll exports.”
Neglected metadata and column-level documentation
“Finding a table/column in a database can sometimes take hours. The fundamental problem is that... nobody properly maintains metadata and documentations.”
Tableau's perceived decline in market relevance
“I am seeing some veteran tableau users move away from the platform, but also firms moving away and fewer and fewer data analyst roles in the market.”
Legacy logic and clunkiness of SAS
“It's idiosyncratic with data types and missing value logic, and its Proc SQL capability is inefficient and lacking in contemporary basics like window functions.”
Power BI CPU consumption limits reached
“We have a model that brought our capacity to its limits. 100% of a P3 capacity with 2-3 queries.”
Difficulty attributing AI-driven referral traffic
“Only a small fraction of citations result in clicks... Without impression data (which AI platforms don't expose), the estimate seems very uncertain.”
Complexity of distilling IoT/Time-Series into metrics
“How does the world of data engineers continually distill real world data into valuable metrics?”
Ragged hierarchy and sorting bugs in visuals
“Numbers 0 and 5 in between the numbers, arent they supposed to be at the beginning? This thing is single-handedly ruining my entire day.”
Normalization dilemma for 1-to-1 relationships
“Should i flatten the data or to what extent... Intuitively it seems flattening all tables that have 1 to 1 relation with the fact table to avoid joins.”
Managing watermarks in complex ELT pipelines
“Running into what feels more like a control-plane/orchestration problem... delta watermarks updated only after Snowflake/dbt commands complete.”
Reddinbox tracks Reddit, X, YouTube and more in real time — sending you alerts the moment your audience starts talking about the problems your product solves.
No credit card required · Cancel anytime