Thoughts on Design,
Technology & Beyond

Exploring the intersection of creativity and innovation through
in-depth articles, case studies, and industry insights.

50+
Articles
10K+
Readers
5
Categories
Scroll to explore

Welcome to the TensorBlue AI Development Blog

Your comprehensive resource for cutting-edge insights on artificial intelligence, machine learning, and software development. Our blog features in-depth articles, practical tutorials, industry case studies, and expert analysis to help you stay ahead in the rapidly evolving world of AI technology.

Whether you're a developer looking to implement LLM fine-tuning, a business leader exploring AI transformation strategies, or a data scientist seeking advanced MLOps practices, you'll find actionable content tailored to your needs. We cover everything from foundational concepts to production deployment, ensuring you have the knowledge to build scalable, reliable AI systems.

AI & Innovation

Explore the latest breakthroughs in artificial intelligence, including large language models, generative AI, computer vision, and neural network architectures. Learn about cutting-edge research and practical applications.

Development Guides

Step-by-step tutorials on implementing AI solutions, from data preparation to production deployment. Comprehensive guides on MLOps, model optimization, and best practices for building enterprise-grade AI systems.

Industry Insights

Real-world case studies and industry-specific AI implementations across healthcare, finance, retail, manufacturing, and more. Learn from successful deployments and understand ROI considerations.

Our expert team of AI engineers, data scientists, and ML practitioners share their knowledge gained from building production AI systems for Fortune 500 companies and innovative startups. Each article is crafted to provide both theoretical understanding and practical implementation guidance, complete with code examples, architecture diagrams, and deployment strategies.

All articles

100 Articles

View:
AI & Innovation
8 min read

AI Is Not Your Accessibility Expert: What LLMs Still Miss About WCAGashokyadav1231

Artificial intelligence is rapidly reshaping the software development landscape. AI-assisted coding tools and large language models (LLMs) are increasingly being integrated into everyday engineering workflows, helping developers accelerate

AI & Innovation
15 min read

Payment Orchestration Platforms in 2026: The Developer’s Guide to a More Flexible Payment StackSitePoint Sponsors

Share this article Payments used to be simpler from a developer's point of view. You connected a gateway, sent a transaction, waited for the response, and handled the result. That model does not scale very well in 2026. Modern online busine

AI & Innovation
15 min read

Claude Code Plan Mode: The Read-First Workflow for Complex RefactorsSitePoint Team

Share this article Refactoring a large codebase with an AI coding assistant sounds straightforward until it isn't. Claude Code's Plan Mode addresses these failures directly by enforcing a read-first workflow that separates the reasoning pha

AI & Innovation
14 min read

How to Route DeepSeek-V4 Through Claude Code for Local Agentic CodingSitePoint Team

Share this article How to Route DeepSeek-V3 Through Claude Code 1. **Verify** that DeepSeek's API endpoint supports Anthropic Messages API schema (or plan to use a translation proxy like LiteLLM). 2. **Install** Node.js v18+ and the Claude

AI & Innovation
16 min read

Which DeepSeek Model Fits Your Hardware? VRAM Sizing Guide for 2026SitePoint Team

Share this article Running DeepSeek models locally in 2026 offers cost savings and data privacy, but GPU VRAM is the single constraint that determines whether a model runs, crawls, or crashes outright. This guide provides a concrete sizing

Technology
5 min read

Why Function Calling Is More Important Than Prompt EngineeringChristian chimeremeze ezenwa

For the past few years, prompt engineering has been one of the most discussed topics in artificial intelligence. Developers shared prompt templates. Companies hired prompt engineers. Social media became filled with examples of carefully cra

AI & Innovation
4 min read

Why Building AI Products Is More Than Connecting an API

When I first started exploring artificial intelligence, I assumed building an AI product was relatively straightforward. The process seemed simple. Connect an AI model to an application, send prompts, receive responses, and display the resu

Business
10 min read

Risk First Compliance

Transitioning from a "Compliancе-First" approach to a "Risk-First" mindset rеcognizеs that compliancе should not be viеwеd in isolation, but as a componеnt of a broadеr risk managеmеnt strategy.

AI & Innovation
12 min read

Staff Engineers Impact Incidents

Staff engineers impact incidents by modeling transparent and productive, serving as incident commanders to coordinate response, and getting involved in retrospectives to address root cultural issues.

Technology
18 min read

Architecting Java Persistence Patterns And Strategies

Discover Java persistence patterns: Driver, Mapper, DAO, Active Record, Repository. Balance layers and optimize data flow.

Technology
12 min read

Incident Lifecycle Resilience

Build organizational resilience to incidents through improved coordination and communication, blameless reviews, root cause analysis, and insightful communication to enable meaningful change.

Technology
10 min read

Orchestrating Resilience Modern Asynchronous Systems

In this article, we will discuss what problems we had to solve at Twilio to efficiently build a resilient and scalable asynchronous system and the advantages we got adopting Workflow Orchestration.

Technology
12 min read

Breaking Changes Are Broken Semver

In this article, we address the most contentious parts of the SemVer standard to understand how you can trade off backward compatibility and upgradability with modernization and iterability.

AI & Innovation
15 min read

Large Language Models Llms Prompting

In this article, authors Numa Dhamani and Maggie Engler discuss how prompt engineering techniques can help use the large language models (LLMs) more effectively to achieve better results.

Technology
14 min read

Cloud Computing Post Serverless Trends

Discover the evolution of cloud-computing in the post-serverless era, with a shift towards hyper-specialized vertical services and a trend from Infrastructure as Code to Composition as Code.

AI & Innovation
10 min read

Ai Organizational Resilience

Organizations should empower staff to determine where generative AI makes sense, while building literacy on capabilities and limits. A human-centric, iterative approach will produce the best outcomes.

Technology
16 min read

Mva Enough Architecture

The Minimum Viable Architecture (MVA) is the architectural complement to a Minimum Viable Product (MVP). The MVA and MVP must evolve together for a product to be successful.

AI & Innovation
15 min read

Product Leadership Solution Focused

This article explores how and which parts of coaching and nuanced language can help you leverage your interactions to yield better results in product management using a solution-focused approach.

AI & Innovation
13 min read

Understanding Architectures Multiregion Data Residency

The main focus of this article is the effective implementation of data residency strategies while ensuring a positive experience for all stakeholders.

Technology
9 min read

Idempotence Aws Serverless Architecture

Understand idempotence in AWS serverless setups, tackling challenges from at-least-once delivery. Learn to implement and automate idempotence in AWS Lambda functions for reliability

Technology
9 min read

Platform Sre Evolving Devops

As DevOps has evolved from nice to have to must have, organizations need to evolve their practices using site reliability and platform engineering. Getting the balance right is hard and necessary.

Technology
20 min read

Spring Boot 3 2 Spring 6 1

Spring Framework 6.1 and Spring Boot 3.2 run on Java 21, make concurrent programming simpler and more efficient with virtual threads, and initially support “Scale to Zero” startup time with CRaC.

Technology
10 min read

Productivity Constant Change

To maximize engineering productivity during constant change, leaders can support their teams by learning how to use some leadership frameworks to adjust based on the context and situation.

AI & Innovation
11 min read

Generative Ai Fraud Prevention

The article examines how generative AI impacts fraud detection by reducing false positives and adapting to evolving fraud patterns, offering a potent solution when combined with machine learning.

Technology
13 min read

Achieve Flow Improve Quality

There’s always more to do than is possible to get done, it's important for work to flow effectively. This article discusses 4 steps to achieving operational flow and improving quality in tech teams.

Technology
14 min read

Whats New Php 8 3

Besides performance improvements, PHP 8.3 brings a many new features, including amendments to the existing readonly feature; explicitly-typed class constants; a new #[\\Override] attribute, and more.

AI & Innovation
10 min read

Enabling Developer Creativity

As an engineering manager, it is your responsibility to help facilitate creative thinking skills among the development team,. This article provides concrete advice on ways to encourage creativity.

AI & Innovation
10 min read

Managed Relational Databases Costs

The rising popularity of managed relational databases brings hidden costs. This article shows the importance of monitoring service expenses and understanding operational constraints.

Technology
12 min read

Testing Machine Learning Simulators

Testing machine learning systems is different. Machine Learning applications consist of a few lines of code, with complex networks of weighted data points. The data is where you find issues and bugs.

Technology
10 min read

Agile Lean Architecture

When it comes to software architecture, should you adopt an agile or a lean approach? The answer, of course, is "it depends," as each approach is best suited to different circumstances.

AI & Innovation
8 min read

Rust Procedural Macros Replace Panic

We show how to write advanced macros to step through Rust code and modify it using the standard tooling available in the syn crate and the Fold trait to recursively step through the entire function.

Technology
12 min read

Technical Decision Buy In

This article examines how Comcast has employed the Analytic Hierarchy Process (AHP), a decision-making framework, and adapted it for making technical and non-technical decisions both large and small.

Technology
10 min read

Aws Lambda Under The Hood

The AWS Lamda under the Hood article starts with an introduction to Lambda itself to outline the key concepts of the service and its fundamentals with a deep dive into understanding the system.

Technology
14 min read

Zero Knowledge Proofs Intro

This article presents zero-knowledge proofs, a kind of cryptography used to provide the proof of a secret, such as a private key or the solution to a problem, without sharing it to interested parties.

Technology
33 min read

Leading Team Software Engineer

In this virtual panel, we explore what made people decide to become a leader and how they did it, and we'll find out if we really have to leave tech forever or if there's a way back into engineering.

AI & Innovation
14 min read

Cloudflare Distributed Postgres

Discover how Cloudflare leverages distributed PostgreSQL clusters at the edge, tackling challenges like replication lag. The cross-region architecture ensures resilience and quick failovers.

Design
21 min read

High Availability In The Cloud With Cellular Architecture

Cellular architecture is a design pattern that helps achieve high availability in multi-tenant applications.

Technology
10 min read

Managing Memberships Netflix

In this article, Diwan shares how the Netflix membership team does distributed systems: the architecture bets, technology choices, and operational semantics.

AI & Innovation
19 min read

Pinterest Ad Ranking Ai

Aayush Mudgal of Pinterest presented a session at QCon San Francisco 2023 on Unpacking how Ad Ranking Works at Pinterest, showing how Pinterest uses deep learning for targeting advertisements.

Technology
9 min read

Getting Tech Debt On Roadmap

At QCon San Francisco 2023, Ben Hartshorne talked about integrating technical debt resolutions into a roadmap. It is essential to articulate tech-debt value beyond just calling them technical fixes.

Technology
15 min read

Natural Language Interface Application

In this article, author Ashley Davis discusses how to add a natural language interface to a chatbot application and how to extend the chatbot by adding voice commands.

Technology
8 min read

Architecture Trends 2024

The InfoQ Trends Reports offer InfoQ readers a comprehensive overview of key topics worthy of attention. Our accompanying podcast features discussions digging deeper into some of the trends.

AI & Innovation
7 min read

Infoq Articles Contest 2024

InfoQ encourages software practitioners and domain experts to submit full-length technical educational articles.

Technology
8 min read

Culture Trends 2024

The Culture & Methods Trends in 2024 cover the value of staff plus engineers, DevEx metrics, ways to make remote teams effective, challenges with diversity and software development impact on climate.

AI & Innovation
14 min read

Jakartaee Testing Deep Dive

The article advocates using modern libraries and Testcontainers to facilitate data-driven testing in Java for robust Jakarta Data and Jakarta NoSQL applications.

Technology
12 min read

Multi Cloud Observability Fluent Bit

Discover how Fluent Bit, a lightweight tool for collecting and distributing logs, enhances multi-cloud observability, reducing egress costs, and addressing compliance challenges.

Technology
15 min read

Brittle Test Suite Maybe Too Dry

When DRY is applied to test code, it can cause the tests to become brittle. In this article, I will present guidelines to follow when reducing duplication in tests, and better ways to DRY up tests.

Technology
11 min read

How To Use Multiple Github Accounts

In this article, we show what Git provides for account configuration, its limitations, and the solution to switch accounts automatically based on a project parent directory location.

Technology
18 min read

Nine Steps Agile Architecture

Just as a Minimum-Viable Architecture (MVA) approach does not create a system’s architecture in a single step, adopting an MVA approach takes a series of incremental steps as well.

Design
10 min read

No Need Css Framework

Many developers use CSS frameworks to reduce boilerplate, increase quality, and drive consistency. This sounds good in theory but often fails in practice. Write custom CSS instead.

AI & Innovation
12 min read

Webassembly Component Model

WebAssembly evolves beyond browsers, fostering a polyglot environment where languages like Rust, Python, and JavaScript can interoperate seamlessly using the WebAssembly Component Model (WCM).

Technology
13 min read

Aws Lambda Cold Starts Myths

This insightful InfoQ article dispels the common myths surrounding Lambda Cold Starts, a widely discussed topic in the serverless computing community.

Technology
30 min read

Virtual Panel High Performing Software Teams

In this virtual panel, we'll discuss how engineering managers support teams, what skills they possess, and how they establish alignment and foster knowledge and experience sharing between teams.

Technology
15 min read

Secure Resilient Delivery Pipelines

Your CI/CD pipeline can potentially expose sensitive information. Project teams often overlook the importance of securing their pipelines.

Technology
20 min read

Netflix Highly Reliable Stateful Systems

Building reliable stateful services at scale isn’t a matter of building reliability into the servers, the clients, or the APIs in isolation.

Technology
15 min read

Three As Building Successful Platforms

In this article, I will share key lessons I have learned while building and delivering three platforms over the last two decades, including where we got stuck and how we unblocked ourselves.

AI & Innovation
13 min read

Llm Productivity Experiment

This article describes an experiment that sought to determine if no-cost LLM-based code generation tools can improve developer productivity.

Business
10 min read

Accelerating Technical Decision Making

Carta harnesses the power of a small group of senior engineers called navigators to bridge the gap between global strategy and local decision-making, using a written engineering strategy.

AI & Innovation
16 min read

Rig Data Consistent Microservices

The RIG model formulates three rules for a saga call chain. A gamified RIG tool can be used by teams to model a microservice system that guarantees eventual data consistency.

Technology
14 min read

Trade Offs Minimizing Unhappiness

To architect is to be a frustrated perfectionist; a good architecture minimizes this unhappiness by making trade-offs that can be lived with.

Technology
20 min read

Cost Optimization Engineering Perspective

A single line of code can shape an organization's financial future. Erik Peterson, the CTO and founder at CloudZero, presented an engineering perspective on cloud cost optimization at QCon SF.

Case Studies
13 min read

Config Maps With Spring Boot

Spring Boot is a framework for its agility and workflow. Yet, configuration is a factor for deployment and maintenance. ConfigMaps provides configuration strategies for Spring Boot applications.

AI & Innovation
9 min read

Html Streaming Dom Updates Without Javascript

Web apps provide the best experience when they load quickly and data appears as available. We review how to use streaming HTML to load pages quickly and display data asynchronously without JavaScript.

Technology
23 min read

Virtual Panel Developer Experience Platform Engineering

In this virtual panel, we’ll discuss how teams build platforms, set others up for success, work with developers who use their platform, measure their progress, and adapt to new challenges.

Technology
13 min read

Kubernetes Logging Enhanced Observability

In this article, we will explore the challenges, strategies, and best practices that will help you achieve seamless log management in your Kubernetes environment.

Technology
16 min read

Benchmark Profile Ebpf Code

In this article, we will walk through creating a basic eBPF program in Rust. This simple and approachable eBPF program will intentionally include a performance regression.

AI & Innovation
9 min read

Generative Ai Software Project Management

Gen AI Assistants play to the strengths of professionals with a breadth of experience like software developers who can describe what they want the LLM to complete and critically evaluate the result.

Technology
9 min read

Greenops Operational Efficiency

Our infrastructures have environmental and economic costs; the IT sector is responsible for 1.4% of carbon emissions worldwide. GreenOps can be used to help mitigate this impact.

Design
14 min read

Human Involvement Interface Design

Good interface design is a complex engineering challenge with many dimensions. This article explores the key dimensions of Ownership and whether a Human is involved.

AI & Innovation
16 min read

Platform Runtime Engineering

We need to take the concepts of platform engineering to the code level, reduce cognitive load, help simplify and accelerate software development, and allow for easy maintenance and platform upgrades.

Business
14 min read

Set Piece Strategy Sheen Brisals

In this article, AWS Serverless Hero Sheen Brisals examines how the characteristics of serverless influence us to think in a new way of architecting and evolving modern applications as set pieces.

AI & Innovation
9 min read

Ai Revolution Not Monopolized

Open-source initiatives are vital for democratizing AI technology, providing transparent and extensible tools. The community rapidly turns research into practical AI tools, enhancing their utility.

Technology
14 min read

Architect Software For Greener Future

In this article, Sara Bergman will share tips, tricks, and advice on architecting software for a greener future.

Case Studies
19 min read

Java Virtual Threads A Case Study

This article explores JDK 21's virtual threads, comparing their performance with Open Liberty's thread pool and highlighting key findings and performance issues.

Technology
11 min read

Mastering Long Running Processes

Bernd Ruecker's QCon London 2024 talk highlighted the significance of long-running processes, asynchronous communication, and visual tools like BPMN for improving communication in distributed systems.

AI & Innovation
11 min read

Scaling Challenges

The main objective of this article is to uncover the valuable lessons learned and insights gained from Trainline's journey through the dynamic landscape of digital transportation platforms.

AI & Innovation
12 min read

Thinking Like Architect

Are architects supposed to be the smartest people on the team? Certainly not. Rather, architects make everyone else smarter, for example by sharing decision models or revealing blind spots.

Technology
9 min read

Architectural Retrospectives

The purpose of an architectural retrospective is to use experience to help the development team improve their architecting skills and their way of working as they make architectural decisions.

AI & Innovation
13 min read

Quality Champions Software

Even skilled and motivated agile teams sometimes fail to achieve their own software quality goals. This article presents a practice to assist agile teams in reaching their quality goals.

Case Studies
12 min read

Uber Migration Hybrid Cloud

Uber operates a complex real-time fulfillment system. This article discusses migrating this workload from on-premises to a hybrid cloud architecture with no downtime or business impact.

Technology
13 min read

Adapt Surprises Software Reliant Businesses

This article explores understanding what makes incidents so rare (when and how they do not happen) and so minor (over how much worse they can be) and deliberately enhancing what makes that possible.

Technology
12 min read

Sqlite Java Integration Webassembly

JVM apps often need to run native code. The current options: porting to JVM or dynamic linking, have significant drawbacks. Using Chicory Wasm runtime promises a safer alternative.

Technology
11 min read

Cloud Waste Management

The 2024 "State of FinOps" survey results of the FinOps Foundation mentioned that organizations' top priorities have shifted to reducing cloud waste or unused resources.

Technology
12 min read

Shipping Threads 5 Months

In Jan 2023, we received word that we’d need to build a microblogging service. This article describes how we developed and launched the Threads app at Meta last year.

AI & Innovation
11 min read

Efficient Devsecops Workflows

Michael Friedrich is exploring DevSecOps inefficiencies, highlighting issues like debugging delays. He also showcases AI's potential to streamline workflows efficiency.

AI & Innovation
11 min read

Ai Ml Data Engineering Trends 2024

InfoQ editorial staff and friends of InfoQ are discussing the current trends in the domain of AI, ML and Data Engineering as part of the process of creating our annual trends report.

AI & Innovation
9 min read

Webassembly Containers Dotnet Aspire

In this article, we will dive into .NET Aspire and illustrate how you can orchestrate next-generation distributed applications that consist of containers, WebAssembly workloads, and dependencies.

AI & Innovation
12 min read

Technical Debt Your Friend

The technical debt metaphor is misleading because much of the so-called debt never needs to be repaid. This conclusion is apparent when using the Minimum Viable Architecture (MVA) approach.

Technology
15 min read

Analysis Optimization Change Release Process

The recent CrowdStrike outage highlights the need to uphold best practices in production changes and offers a chance to reevaluate processes for managing complex systems effectively.

Case Studies
12 min read

Llama3 Deployment Applications

Learn about the capabilities of the open-source Llama 3 LLM, how to deploy it in the cloud or on-premise, and how to leverage fine-tuned versions for specific tasks.

AI & Innovation
4 min read

Practical Applications Generative Ai Series

In the InfoQ "Practical Applications of Generative AI" article series, we present real-world solutions and hands-on practices from leading GenAI practitioners in the industry.

AI & Innovation
16 min read

Reactive Java Vertx Deep Dive

This article discusses non-blocking I/O models in software development, focusing on Vert.x for building reactive applications on the JVM, with superior performance in high-concurrency environments.

Technology
15 min read

Platform Engineering Developer Experience

This article is about curating a developer experience, it shares experiences and learnings from implementing DevEx and ideas on what platform engineers can do for development teams that use platforms.

AI & Innovation
10 min read

Building Better Platforms With Empathy

At QCon San Francisco 2023, David Stenglein explored the shift to a product model for internal platforms and how it benefited from people-centric tools like customer empathy and the DevEx framework.

Case Studies
9 min read

Navigating Llm Deployment

Learn how to get the best performance from self-hosted LLMs, with best practices on how to overcome challenges due to model size, GPU scarcity, and a rapidly evolving field.

AI & Innovation
15 min read

Minimize Latency Cost Distributed Systems

Explore the benefits and challenges of microservices architecture in cloud environments, focusing on achieving resilience and high availability while managing costs and performance issues.

AI & Innovation
29 min read

Llm Adoption Considerations

Four experts discuss some issues people should think about when adopting LLMs and how they can make the best choice for their specific use case.

AI & Innovation
19 min read

Functional Programming Web App

Functional programming might seem intimidating and overly academic at first, but once you get the hang of it, it helps you build more maintainable software, making the process more enjoyable.

AI & Innovation
11 min read

Securing Linux Applications

Maintaining a strong security posture is challenging, especially with Linux. An effective approach is proactive and includes patch management, optimized resource allocation, and effective alerting.

AI & Innovation
13 min read

Netflix Global Cache

Netflix's EVCache system powers 400M ops/second with 14.3 PB of data, optimizing global availability, scalability, and efficiency while reducing costs through intelligent data routing and compression.

Stay in the Loop

Get the latest insights on design, technology, and innovation delivered straight to your inbox. Join 10,000+ readers who never miss an update.

No spam, unsubscribe anytime. We respect your privacy.