Mayur Rathi

⭐ 34.1k GitHub stars

Fabric-Lakehouse

Fabric-Lakehouse is an data AI skill with a core value of Use this skill to get context about Fabric Lakehouse and its features for software systems and AI-powered functions. It helps developers solve real-world problems in the data domain, boosting efficiency, automating repetitive tasks, and optimizing workflows.

Use this skill to get context about Fabric Lakehouse and its features for software systems and AI-powered functions. It offers descriptions of Lakehouse data components, organization with schemas and

Last verified on: 2026-07-14

Quick Facts

Category data

Works With Claude, GitHub Copilot

Source github/awesome-copilot

Stars ⭐ 34.1k

Last Verified 2026-07-14

Risk Level Low

mkdir -p ./skills/fabric-lakehouse && curl -sfL https://raw.githubusercontent.com/github/awesome-copilot/main/skills/fabric-lakehouse/SKILL.md -o ./skills/fabric-lakehouse/SKILL.md

Run in terminal / PowerShell. Requires curl (Unix) or PowerShell 5+ (Windows).

Skill Content

# When to Use This Skill

Use this skill when you need to:

- Generate a document or explanation that includes definition and context about Fabric Lakehouse and its capabilities.

- Design, build, and optimize Lakehouse solutions using best practices.

- Understand the core concepts and components of a Lakehouse in Microsoft Fabric.

- Learn how to manage tabular and non-tabular data within a Lakehouse.

# Fabric Lakehouse

Core Concepts

What is a Lakehouse?

Lakehouse in Microsoft Fabric is an item that gives users a place to store their tabular data (like tables) and non-tabular data (like files). It combines the flexibility of a data lake with the management capabilities of a data warehouse. It provides:

- **Unified storage** in OneLake for structured and unstructured data

- **Delta Lake format** for ACID transactions, versioning, and time travel

- **SQL analytics endpoint** for T-SQL queries

- **Semantic model** for Power BI integration

- Support for other table formats like CSV, Parquet

- Support for any file formats

- Tools for table optimization and data management

Key Components

- **Delta Tables**: Managed tables with ACID compliance and schema enforcement

- **Files**: Unstructured/semi-structured data in the Files section

- **SQL Endpoint**: Auto-generated read-only SQL interface for querying

- **Shortcuts**: Virtual links to external/internal data without copying

- **Fabric Materialized Views**: Pre-computed tables for fast query performance

Tabular data in a Lakehouse

Tabular data in a form of tables are stored under "Tables" folder. Main format for tables in Lakehouse is Delta. Lakehouse can store tabular data in other formats like CSV or Parquet, these formats are only available for Spark querying.

Tables can be internal, when data is stored under "Tables" folder, or external, when only reference to a table is stored under "Tables" folder but the data itself is stored in a referenced location. Tables are referenced through Shortcuts, which can be internal (pointing to another location in Fabric) or external (pointing to data stored outside of Fabric).

Schemas for tables in a Lakehouse

When creating a lakehouse, users can choose to enable schemas. Schemas are used to organize Lakehouse tables. Schemas are implemented as folders under the "Tables" folder and store tables inside of those folders. The default schema is "dbo" and it can't be deleted or renamed. All other schemas are optional and can be created, renamed, or deleted. Users can reference a schema located in another lakehouse using a Schema Shortcut, thereby referencing all tables in the destination schema with a single shortcut.

Files in a Lakehouse

Files are stored under "Files" folder. Users can create folders and subfolders to organize their files. Any file format can be stored in Lakehouse.

Fabric Materialized Views

Set of pre-computed tables that are automatically updated based on a schedule. They provide fast query performance for complex aggregations and joins. Materialized views are defined using PySpark or Spark SQL and stored in an associated Notebook.

Spark Views

Logical tables defined by a SQL query. They do not store data but provide a virtual layer for querying. Views are defined using Spark SQL and stored in Lakehouse next to Tables.

Security

Item access or control plane security

Users can have workspace roles (Admin, Member, Contributor, Viewer) that provide different levels of access to Lakehouse and its contents. Users can also get access permission using sharing capabilities of Lakehouse.

Data access or OneLake Security

For data access use OneLake security model, which is based on Microsoft Entra ID (formerly Azure Active Directory) and role-based access control (RBAC). Lakehouse data is stored in OneLake, so access to data is controlled through OneLake permissions. In addition to object-level permissions, Lakehouse also supports column-level and row-level security for tables, all

🎯 Best For

Claude users
GitHub Copilot users
Data professionals
Analytics teams
Researchers

💡 Use Cases

Data pipeline auditing
Query optimization

📖 How to Use This Skill

1
Install the Skill

Copy the install command from the Terminal tab and run it. The SKILL.md file downloads to your local skills directory.
2
Load into Your AI Assistant

Open Claude or GitHub Copilot and reference the skill. Paste the SKILL.md content or use the system prompt tab.
3
Apply Fabric-Lakehouse to Your Work

Provide context for your task — paste source material, describe your audience, or share existing work to guide the AI.
4
Review and Refine

Edit the AI output for accuracy, tone, and completeness. Add human insight where the AI lacks context.

❓ Frequently Asked Questions

How do I install Fabric-Lakehouse?

Copy the install command from the Terminal tab and run it. The skill downloads to ./skills/fabric-lakehouse/SKILL.md, ready to use.

Can I customize this skill for my team?

Absolutely. Edit the SKILL.md file to add team-specific instructions, examples, or workflows.

⚠️ Common Mistakes to Avoid

Ignoring data quality

AI analysis inherits all data quality issues — profile your data first.

🔗 Related Skills

acreadiness-assess Acreadiness-Assess acreadiness-policy Acreadiness-Policy adr-generator ADR Generator ai-prompt-engineering-safety-best-practices Ai-Prompt-Engineering-Safety-Best-Practices ai-readiness-reporter Ai-Readiness-Reporter ai-ready Ai-Ready