Open in app

Sign In

Write

Sign In

Deniz Parmaksız
Deniz Parmaksız

589 Followers

Home

About

Published in

Insider Engineering

·1 day ago

Building a Feature Store with Apache Iceberg on AWS

Insider has been leveraging AI/ML since the very early years of the company to differentiate in the MarTech field. …

Feature Store

4 min read

Building a Feature Store with Apache Iceberg on AWS
Building a Feature Store with Apache Iceberg on AWS
Feature Store

4 min read


Published in

Insider Engineering

·Sep 11

Realtime Alerting from Applications Logs in Amazon CloudWatch

Monitoring the health and performance is critical for any application and the business they are supporting. In case of an SLA breach or degradation, the application team must receive an immediate alert to investigate and resolve the issue. Our data and machine learning teams are running applications using a wide…

AWS

4 min read

Realtime Alerting from Applications Logs in Amazon CloudWatch
Realtime Alerting from Applications Logs in Amazon CloudWatch
AWS

4 min read


Published in

Insider Engineering

·Jun 21

Deploying AWS Lambda Functions for Machine Learning Workloads

AWS Lambda is a serverless and event-driven computing service in Amazon’s cloud computing offerings. Lambda functions enable organizations to run application code without managing servers and can automatically scale to hundreds of thousands of executions, within account limits. Lambda functions can be deployed in two different ways; using a zip…

AWS Lambda

4 min read

Deploying AWS Lambda Functions for Machine Learning Workloads
Deploying AWS Lambda Functions for Machine Learning Workloads
AWS Lambda

4 min read


Published in

Insider Engineering

·Apr 4

Sync Autotuner Reduced Our EMR Cost by 25%

Amazon EMR is our go-to solution to run Apache Spark for data processing, interactive analytics, and machine learning on AWS. As a fast-growing tech company, we rely on Amazon EMR to run over 10,000 Spark applications every day to fuel our products with up-to-date data and predictions. However, over time…

AWS

4 min read

Sync Autotuner Reduced Our EMR Cost by 25%
Sync Autotuner Reduced Our EMR Cost by 25%
AWS

4 min read


Published in

Insider Engineering

·Oct 5, 2022

How We Migrated Our Data Lake to Apache Iceberg

We recently migrated our production data lake containing tens of terabytes of data in Amazon S3 from Apache Hive to Apache Iceberg and achieved a 90% cost saving for Amazon S3. If you are interested in the cost analysis and its reasons, you should also read our other post…

Apache Iceberg

5 min read

How We Migrated Our Data Lake to Apache Iceberg
How We Migrated Our Data Lake to Apache Iceberg
Apache Iceberg

5 min read


Published in

Insider Engineering

·Sep 28, 2022

Apache Iceberg Reduced Our Amazon S3 Cost by 90%

The new generation data lake table formats (Apache Hudi, Apache Iceberg, and Delta Lake) are getting more traction every day with their superior capabilities compared to Apache Hive. They enable cost-effective cloud solutions for big data analysis with ACID transactions, schema evolution, time travel, and more. Table Formats Table format technology is…

Apache Iceberg

5 min read

Apache Iceberg Reduced Our Amazon S3 Cost by 90%
Apache Iceberg Reduced Our Amazon S3 Cost by 90%
Apache Iceberg

5 min read


Published in

Insider Engineering

·Mar 28, 2022

Triggering Airflow Workflows After Data Modification in Amazon Aurora

Amazon Aurora is a great choice for scalable, available, reliable, and cost-effective MySQL and PostgreSQL compatible RDBMS solutions that we heavily use at Insider. It enables us to focus more on product development while the managed service handles time-consuming tasks like hardware provisioning, database setup, patching, and backups. …

Amazon Aurora

3 min read

Triggering Airflow Workflows After Data Modification in Amazon Aurora
Triggering Airflow Workflows After Data Modification in Amazon Aurora
Amazon Aurora

3 min read


Published in

Insider Engineering

·Feb 28, 2022

Scaling ML Model Serving on Amazon EKS with Custom Metrics

Serving machine learning models for real-time prediction is a hot topic and there are numerous solutions out there. The easiest one is using a fully managed service like Amazon Sagemaker that handles all the operational burden of the deployments and scaling for you. Other than that, there are model serving…

Kubernetes

5 min read

Scaling ML Model Serving on Amazon EKS with Custom Metrics
Scaling ML Model Serving on Amazon EKS with Custom Metrics
Kubernetes

5 min read


Published in

Insider Engineering

·Jan 31, 2022

Benchmarking Amazon EMR vs Databricks

At Insider, we use Apache Spark as the primary data processing engine to mine our clients’ clickstream data and feed ML-ready data into our machine learning pipelines to enable personalizations. We have been using Spark since version 1.5 and always looking for ways to improve efficiency. If you are interested…

Apache Spark

8 min read

Benchmarking Amazon EMR vs Databricks
Benchmarking Amazon EMR vs Databricks
Apache Spark

8 min read


Published in

Insider Engineering

·Jun 6, 2021

Spark 3 Reduced Our EMR Cost by 40%

One of the great reasons for upgrading to Spark 3 and EMR 6 Spark 3.0 has been released in June 2020 and arrived at the AWS EMR service with EMR 6.1 version in September 2020. After almost a year, we managed to upgrade our Spark version to 3.1.1 and…

Spark

4 min read

Spark 3 Reduced Our EMR Cost by 40%
Spark 3 Reduced Our EMR Cost by 40%
Spark

4 min read

Deniz Parmaksız

Deniz Parmaksız

589 Followers

Sr. Machine Learning Engineer at Insider | AWS Ambassador

Following
  • Darius Foroux

    Darius Foroux

  • Pinterest Engineering

    Pinterest Engineering

  • Sync

    Sync

  • Tabular

    Tabular

  • Jesus Rodriguez

    Jesus Rodriguez

See all (54)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams