Generic filters
Exact matches only
Search in title
Search in content
Search in excerpt

Data Science Overview | Tools, Tech & Modern Roles in the Data Driven Enterprise (TTDS6000)

The Data Science & Big Data Overview | Tools, Tech & Modern Roles in the Data-Driven Enterprise is an introductory level course that introduces the entire multi-disciplinary Data Science team to the many evolving and related terms, with focus on Big Data, Data Science, Predictive Analytics, Artificial Intelligence, Data Mining, Data Warehousing. The overview explores the current state of the art and science, the major components of a modern data science infrastructure, team roles and responsibilities, and level-setting realistic possible outcomes for your investment.   This goal of this course is to provide students with a baseline understanding of core concepts and technologies to a conversant level.

What You’ll Learn

This course provides a high-level view of a variety of core, current data science related technologies, strategies, skillsets, initiatives and supporting tools in common business enterprise practices.  This list covers a general range of topics current to the time of course distribution. We will collaborate with your team to refine level of depth of coverage, understand areas of greater importance to your team, where you would like to add demos, etc.

  • Price: $895.00
  • Duration: 1 day
  • Delivery Methods: Virtual
Date Time Price Option
10/24/2024 09:00 AM - 05:00 PM CT $895.00
12/05/2024 10:00 AM - 06:00 PM CT $895.00
01/31/2025 10:00 AM - 06:00 PM CT $895.00
03/14/2025 09:00 AM - 05:00 PM CT $895.00
05/16/2025 09:00 AM - 05:00 PM CT $895.00
06/18/2025 09:00 AM - 05:00 PM CT $895.00
08/01/2025 09:00 AM - 05:00 PM CT $895.00
09/12/2025 09:00 AM - 05:00 PM CT $895.00
10/24/2025 09:00 AM - 05:00 PM CT $895.00
12/05/2025 10:00 AM - 06:00 PM CT $895.00
For questions call: (469) 721-6100

Why choose
TOPTALENT?

  • Get assistance every step of the way from our Texas-based team, ensuring your training experience is hassle-free and aligned with your goals.
  • Access an expansive range of over 3,000 training courses with a strong focus on Information Technology, Business Applications, and Leadership Development.
  • Have confidence in an exceptional 95% approval rating from our students, reflecting outstanding satisfaction with our course content, program support, and overall customer service.
  • Benefit from being taught by Professionally Certified Instructors with expertise in their fields and a strong commitment to making sure you learn and succeed.

Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We will work with you to tune this course and level of coverage to target the skills you need most.

Foundations

  • Grids and Virtualization
    • Service-Oriented Architecture
    • Enterprise Service Bus
  • Enterprise Message Bus
  • The Cloud

The Hadoop Ecosystem

  • HDFS: Hadoop Distributed File System
  • Resource Negotiators: YARN, Mesos, and Spark; ZooKeeper
  • Hadoop Map/Reduce
  • Spark
  • Hadoop Ecosystem Distributions: Cloudera, Hortonworks, OpenSource

Big Data, NOSQL, and ETL

  • Big Data vs. RDBMS
  • NOSQL: Not Only SQL
  • Relational Databases: Oracle, MariaDB, DB/2, SQL Server, PostGreSQL
  • Key/Value Databases: JBoss Infinispan, Terracotta, Dynamo, Voldemort
  • Columnar Databases: Cassandra, HBase, BigTable
  • Document Databases: MongoDB, CouchDB/CouchBase
  • Graph Databases: Giraph, Neo4J, GraphX
  • Apache Hive
  • Common Data Formats
  • Leveraging SQL and SQL variants

ETL: Exchange, Transform, Load

  • Data Ingestion, Transformation, and Loading
  • Exporting Data
  • Sqoop, Flume, Informatica, and other tools

Enterprise Integration Patterns and Message Busses

  • Enterprise Integration Patterns: Apache Camel and Spring Integration
  • Enterprise Message Busses: Apache Kafka, ActiveMQ, and other tools

An Overview of Developing in Hadoop Ecosystem

  • Languages: R, Python, Java, Scala, Pig, and BPMN
  • Libraries and Frameworks
  • Development, Testing, and Deployment

Exploring Artificial Intelligence and Business Systems

  • Artificial Intelligence: Myths, Legends, and Reality
    • The Math
    • Statistics
    • Probability
  • Clustering Algorithms, Mahout, MLLib, SciKit, and Madlib
  • Business Rule Systems: Drools, JRules, Pegasus

The Modern Data Team

  • Agile Data Science
  • NOSQL Data Architects and Administrators
  • Developers
  • Grid Administrators
  • Business and Data Analysts
  • Management
  • Evolving your Team
  • Growing your Infrastructure

This course provides a high-level view of a variety of core, current data science related technologies, strategies, skillsets, initiatives and supporting tools in common business enterprise practices.  This list covers a general range of topics current to the time of course distribution. We will collaborate with your team to refine level of depth of coverage, understand areas of greater importance to your team, where you would like to add demos, etc.

Throughout the session you’ll:

  • Foundations: Grids & Virtualization; SOA, ESB / EMB, The Cloud
  • The Hadoop Ecosystem: HDFS; Resource Navigators, MapReduce, Spark, Distributions
  • Big Data, NOSQL, and ETL
  • ETL: Exchange, Transform, Load
  • Handling Data & a Survey of Useful tools
  • Enterprise Integration Patterns and Message Busses
  • Developing in Hadoop Ecosystem: R, Python, Java, Scala, Pig, and BPMN
  • Artificial Intelligence and Business Systems
  • Who’s on the Team? Evolving Roles and Functions in Data Science
  • Growing your Infrastructure

Need different skills or topics?  If your team requires different topics or tools, additional skills or custom approach, this course may be further adjusted to accommodate.  We offer additional Big Data / Data Science, Hadoop, programming, analytics, Python/R, and other related topics that may be blended with this course for a track that best suits your needs. Our team will collaborate with you to understand your needs and will target the course to focus on your specific learning objectives and goals.

This introductory-level / primer course is an overview intended for Business Analysts, Data Analysts, Data Architects, DBAs, Network (Grid) Administrators, Developers or anyone else in the data science realm who need to have a baseline understanding of some of the core areas of modern Data Science technologies, practices and available tools.

Attendees should have prior exposure to Enterprise Information Technology. As well as familiarity with Relational Databases.

Attendees should have prior exposure to Enterprise Information Technology. As well as familiarity with Relational Databases.

Ten (10) business days’ notice is required to reschedule a class with no additional fees. Notify TOPTALENT LEARNING as soon as possible at 469-721-6100 or by written notification to info@toptalentlearning.com to avoid rescheduling penalties.
Please contact our team at 469-721-6100; we will gladly guide you through the online purchasing process.
You will receive a receipt and an enrollment confirmation sent to the email you submitted at purchase. Your enrollment email will have instructions on how to access the class. Any additional questions our team is here to support you. Please call us at 469-721-6100.
If a student is 15 minutes late, they risk losing their seat to a standby student. If a student is 30 minutes late or more, they will need to reschedule. A no-show fee will apply. Retakes are enrolled on a stand-by basis. The student must supply previously issued courseware. Additional fees may apply.
You will receive a ‘Certificate of Completion’ once you complete the class. If you purchased an exam voucher for the class, a team member from TOPTALENT LEARNING will reach out to discuss your readiness for the voucher and make arrangements to send it.