Table of Contents

[hemmerling] Data Processing 5/8 - Business Intelligence, Data Mining

Related pages:

Organizations

Conferences, Seminars, Trainings

Germany

  1. Data Modeling Zone in Hannover ( 2013-09-23 - 2013-09-24 ).
  2. TDWI Young Guns.
      1. Full-day online training DatenPioniere GmbH "Microsoft Power BI Training - Wir machen Euch fit", 2023-02-03, 2023-03-03, 08:30-16:30. I joined :-).
        • “Kostenfreie Anmeldung. Werde mit uns zum Power BI Champion und melde Dich heute noch zur kostenfreien Basis Schulung an”.
        • “Jeden Monat bietet Dir Niklas ein kostenfreies Tagestraining an. Wir sind davon überzeugt, dass mehr gute Entscheidungen in deinem Unternehmen getroffen werden, wenn mehr Mitarbeitende Power BI beherrschen. Melde dich heute noch an”.
        • As of 2023-01:
          • Power BI Pro doesn't provide machine learning / artificial intelligence functions :-(.
          • Power BI Premium provides machine learning / artificial intelligence functions :-).
        • Starscheme: Separation of fact table and dimension table.
          • Fact table ( FACT_xxx, F_xxx ): German w-questions ( who, what, when, amount, rebate.. ), what is relevant for a financial transaction ( buy / sell / contract ).
          • Dimension table ( DIM_xxx, D_xxx ): Product ID, product category, country of customer.
          • If a customer buys 1000x the same good, you don't want all the 1000 data items in the dimension table.
        • Spreadsheet Cell vs. Calculated column vs. Measure.
          • In opposite to spreadsheets, BI tools are not based on the “cell” principle :-(, you can just address columns, not single cells :-(.
          • Calculated columns require filespace, Measures don't require filespace, as they are calculated in-memory.
          • Calculations
            Umsatz = 'F_Order Details'[Unit Price] * 'F_Order Details'[Quantity] *(1-'F_Order Details'[Discount])
            Einkaufskosten = 'F_Order Details'[Quantity] * RELATED(D_Products[UnitCost])
            Gewinn = 'F_Order Details'[Umsatz] - 'F_Order Details'[Einkaufskosten]
            
            Datumstabelle =
            VAR ErsteErfassung = Min(F_Orders[Order Date])
            VAR Kalender = CALENDAR(Date(Year(ErsteErfassung),1,1), Date(Year(Now())+1,12,31))
            return
            ADDCOLUMNS( Kalender,  
            "Jahr", Year([Date]),
            "JahrQuartal", FORMAT([Date], "yyyy-Q"),
            "JahrMonat", FORMAT([Date],"yyyy-mm"),
            "JahrKW", COMBINEVALUES("-", Year([Date]), FORMAT(WEEKNUM([Date],21),"00")),
            "Quartal", FORMAT([Date], "\QQ"),
            "Kalenderwoche", WEEKNUM([Date], 21),
            "Monat", FORMAT([Date], "mmm"),
            "Monat#", Month([Date]),
            "Wochentag", FORMAT([Date], "ddd"),
            "Wochentag#", Weekday([Date], 2)
            )
            
      • Important settings:
        • “File / Options and settings / Options” - “Global / Data Load”.
          • ”[_] Time intelligence”.
        • “File / Options and settings / Options” - “Current File / Data Load”.
          • ”[_] Time intelligence”.
          • ”[_] Import relationships from data soruces on first load”.
          • ”[_] Autodetect new relationships after data is loaded”.
      • Drilldown fields “Category Name, County aus Customers, Last Name aus Employes”.
      • After creating a Measure move it to a “Card” ( “123” ) visualisation!
        Umsatz Measure = Sum('F_Order Details'[Umsatz])
        !
        Umsatz Adams I =
        CALCULATE(
            Sum('F_Order Details'[Umsatz]),
            D_Employees[Last Name] = "Adams"
        )
        
        Umsatz Adams in Deutschland =
        Calculate(
            Sum('F_Order Details'[Umsatz]),
            D_Employees[Last Name] = "Adams",
            D_Customers[Country] = "Germany"
        )
        
        Umsatz Adams und Buchanan =
        CALCULATE(
            [Umsatz Measure],
            D_Employees[Last Name] = "Adams" ||
            D_Employees[Last Name] = "Buchanan"
        )
        
        Umsatz Adams II =
        Calculate(
            Sum('F_Order Details'[Umsatz]),
            D_Employees[Employee ID] = 4 //4 ist die ID für Adams
        )
        
        Gesamtumsatz =
        CALCULATE(
            [Umsatz Measure],
            All('F_Order Details')
            
        Anteil Umsatz = DIVIDE([Umsatz Measure],[Gesamtumsatz])
        
        Umsatz nach Shipped Date =
        Calculate(
            Sum('F_Order Details'[Umsatz]),
            USERELATIONSHIP(Datumstabelle[Date], F_Orders[Shipped Date])
        )
        
      1. The free event DatenPioniere "BI Arena auf Schalke" at Veltins Arena, Rudi-Assauer-Platz 1, 45891 Gelsenkirchen, 2025-01-29, 8:00 – 14:30.
        • I applied, but was rejected: “Leider sind die Plätze vor Ort rar, daher sind diese für Endkunden reserviert” :-(.
  3. “M365 Summits”.
  4. Online events by BI or DIE Self Service.
  5. Free conferences and expositions Tech Show Frankfurt / Big Data & AI World, Cloud Expo Europe Frankfurt / Data Centre World in Frankfurt, 2023-05-10 - 2023-05-11.

International

  1. DataTalks.Club - “Free Data Engineering course!” ( on demand ).
  2. Global Data Summit in USA ( 2017, 2019 ).
  3. The affordable commercial online event Power BI Summit, 2023-03-06 - 2023-03-10.
  4. The free live virtual event Open Source Data Summit ( OSDS ).
    • 2023-11-15.
    • 2024-??-??.

Software

Free ETL Software

Free Tools

Resources

Free BI Software

Free Data Warehouse Solutions

Free BI Tools

Free BI Frameworks

Data Cleanup

Machine Learning

Important Machine Learning Tools according to "Developer Economics - National Trends Survey", 2016-04
Tools
Resources

Microsoft BI Tools

The free standalone Microsoft Power BI

The Tool
Online Services
Downloads
Resources

Commercial Editions of Microsoft SQL Server with BI Support

Power BI Gateway - Personal

The Tool
Documentation
Concept
Resources

Free Addons for the commercial Microsoft Excel

Tools

Query and Data Modeling Languages

Azure Log Analytics Query Language
Analysis Services Scripting Language ( ASS )
Data Analysis Expressions ( DAX ) - Data Modeling Language
"M" Language ( "Power Query Formula Language" / Microsoft Power Query for Excel Formula Language" )
MultiDimensional eXpressions ( MDX )
Tabular Modeling Scripting Language ( TMSL )

Datazen

Data Science Platform by Joshua Görner

"TWDE Datalab" by ThoughtWorks

Some other commercial BI Software

Services

BigML

EdgeImpulse Inc.

Google

Google Colab ( Colaboratory )

Google Looker

Microsoft KUSTO

The Service

Azure Log Analytics Query Language

Resources

SAP HANA

The Service

Education

Events

Resources

Some other commercial BI Tools

Complex event processing, Streaming Analytics

Realtime Data Warehouse

FIWARE "Orion ( Context Broker )"

Google Cloud Stream Analytics Solution

Microsoft Technologies for Realtime Data Warehouses

Microsoft Azure Stream Analytics

Microsoft SQL Server - StreamInsight

Microsoft KUSTO

Resources

Data Analysis & Dashboards

Full Text Search

Artificial Intelligence

Webcasts. Webinars

Experts

Literature

Resources

Forums, Newsgroups


When this document changes ! Site Navigation ( My Business ! My Topics ! Imprint / Contact ! Privacy Policy ! Keyword Index ! ! Google+ Publisher "hemmerling" )