read our blogs

Blogs or Expert Columns

Theano

Overview Theano is a Python library for fast numerical computation that can be run on the CPU or GPU. It is a key foundational library for Deep Learning in Python that you can use directly to create Deep Learning models or wrapper libraries that greatly simplify the process. In this post, you will discover the Theano Python library. What is Theano? Theano is an open source project released under the BSD license and was developed by the LISA (now MILA) group at the University of Montreal, Quebec, Canada (home of Yoshua Bengio). It is named after a Greek mathematician. At its heart, [...]

Caffe 

Caffe is a deep learning framework made with expression, speed, and modularity in mind. It is developed by Berkeley AI Research (BAIR) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license. Why Caffe? Expressive architecture encourages application and innovation. Models and optimization are defined by configuration without hard-coding. Switch between CPU and GPU by setting a single flag to train on a GPU machine then deploy to commodity clusters or mobile devices. Extensible code fosters active development. In Caffe’s first year, it has been forked by over [...]

Teradata

What is Teradata? Teradata is an RDBMS (relational database management system) which includes features: It’s built on completely parallel architecture which means single task will be divided into smaller chunks and compute simultaneously hence faster execution. Teradata system is a shared-nothing architecture in which each node is independent and self-sufficient. Also, each logical processor (AMP) is responsible only for their own portion of the database. Supporting industry standard ANSI SQL to communicate with Teradata. Teradata database can be accessed by multiple concurrent users from different client applications via popular TCP/IP connection or IBM mainframe channel connection. Why use Teradata? [...]

Spark ML

Spark ML Spark.ml is a new package introduced in Spark 1.2, which aims to provide a uniform set of high-level APIs that help users create and tune practical machine learning pipelines. It is currently an alpha component, and we would like to hear back from the community about how it fits real-world use cases and how it could be improved.Note that we will keep supporting and adding features to spark.mllib along with the development of spark.ml. Users should be comfortable using the spark.mllib features and expect more features coming. Developers should contribute new algorithms to spark.mllib and can optionally [...]

Java

Java is a programming language and a computing platform for application development. It was first released by Sun Microsystem in 1995 and later acquired by Oracle Corporation. It is one of the most used programming languages. Oracle has two products that implement Java Platform Standard Edition (Java SE) 8: Java SE Development Kit (JDK) 8 and Java SE Runtime Environment (JRE) 8. JDK 8 is a superset of JRE 8 and contains everything that is in JRE 8, plus tools such as the compilers and debuggers necessary for developing applets and applications. JRE 8 provides the libraries, the Java [...]

ETL – Extract Transform Load

Overview One of the ways to integrate with external systems is using the Extract, Transform, Load (ETL) system. The ETL system creates databases which can be accessed directly by 3rd party tools and solutions. It also allows scheduled execution of transformation scripts on the Data Center Operation server. Together this gives the ability to extract and enter data on the Data Center Operation server. Based on the ETL system, it is possible to develop custom solutions, integrating DCO with a broad range of data sources. ETL can be used in 2 ways: Importing Data in to and Exporting Data from Data Center Operation External system integration using ETL ETL modeling concepts The general framework for ETL [...]

Multi-tenancy

Multi-tenancy is an architecture in which a single instance of a software application serves multiple customers. Each customer is called a tenant. Tenants may be given the ability to customize some parts of the application, such as the color of the user interface (UI) or business rules, but they cannot customize the application's code. Multi-tenancy can be economical because software development and maintenance costs are shared. It can be contrasted with single-tenancy, an architecture in which each customer has their own software instance and may be given access to the code. With a multi-tenancy architecture, the provider only has [...]

Azure HDInsight Spark

Azure HDInsight Spark Overview Introduction to Spark on HDInsight This article provides you with an introduction to Spark on HDInsight. Apache Spark is an open-source parallel processing framework that supports in-memory processing to boost the performance of big-data analytic applications. Spark cluster on HDInsight is compatible with Azure Storage (WASB) as well as Azure Data Lake Store. Hence, your existing data stored in Azure can easily be processed via a Spark cluster. When you create a Spark cluster on HDInsight, you create Azure compute resources with Spark installed and configured. It only takes about 10 minutes to create a Spark cluster [...]

Compression 

Compression is a simple, effective way to save bandwidth and speed up your site. It’s the 21st century. Most of the traffic comes from modern browsers, and quite frankly, most of the users are fairly tech-savvy. No one wants to slow someone else down because somebody is chugging along on IE 4.0 on Windows 95. Google and Yahoo use gzip compression. A modern browser is needed to enjoy modern web content and modern web speed — so gzip encoding it is. Here’s how to set it up. Before we start we should explain what content encoding is. When you request a [...]

Amazon Rekognition

What Is Amazon Rekognition? Amazon Rekognition makes it easy to add image and video analysis to your applications. You just provide an image or video to the Rekognition API, and the service can identify objects, people, text, scenes, and activities. It can detect any inappropriate content as well. Amazon Rekognition also provides highly accurate facial analysis and facial recognition. You can detect, analyze, and compare faces for a wide variety of use cases, including user verification, cataloging, people counting, and public safety. Amazon Rekognition is based on the same proven, highly scalable, deep learning technology developed by Amazon’s computer [...]

Amazon S3( Simple Storage Service)

What Is Amazon S3? Amazon Simple Storage Service is storage for the Internet. It is designed to make web-scale computing easier for developers. Amazon S3 has a simple web services interface that you can use to store and retrieve any amount of data, at any time, from anywhere on the web. It gives any developer access to the same highly scalable, reliable, fast, inexpensive data storage infrastructure that Amazon uses to run its own global network of web sites. The service aims to maximize benefits of scale and to pass those benefits on to developers. This guide explains the [...]

Apache Sentry

What is Apache Sentry? Apache Sentry is a granular, role-based authorization module for Hadoop. Sentry provides the ability to control and enforce precise levels of privileges on data for authenticated users and applications on a Hadoop cluster. Sentry currently works out of the box with Apache Hive, Hive Metastore/HCatalog, Apache Solr, Impala and HDFS (limited to Hive table data). Sentry is designed to be a pluggable authorization engine for Hadoop components. It allows you to define authorization rules to validate a user or application’s access requests for Hadoop resources. Sentry is highly modular and can support authorization for a [...]

Amazon AWS

What Is Amazon AWS? Amazon AWS is the implementation of Amazon Web Services in the Beijing and Ningxia (China) Regions. Directly connected to domestic telecommunication networks, it's designed to provide the infrastructure and network services necessary to support AWS technology. Amazon AWS provides computing resources and services that you can use to quickly and cost-effectively build applications. For example, you can rent a virtual server on Amazon AWS that you can connect to, configure, secure, and run just as you would a physical server. But the virtual server runs on top of a global network managed by AWS, and [...]

Security 

Web server security is the protection of information assets that can be accessed from a Web server. Web server security is important for any organization that has a physical or virtual Web server connected to the Internet. It requires a layered defense and is especially important for organizations with customer-facing websites. Separate servers should be used for internal and external-facing applications and servers for external-facing applications should be hosted on a DMZ or containerized service network to prevent an attacker from exploiting a vulnerability to gain access to sensitive internal information. Penetration tests should be run on a regular basis to identify potential attack vectors, which are often caused by out-of-date server modules, configuration or coding [...]

How to Use MySQL

Overview This Document explains some of the basic SQL statements. If this is the first time you have used a relational database management system, this tutorial gives you everything you need to know to work with MySQL such as querying data, updating data, managing databases, and creating tables. If you’re already familiar with other relational database management systems such as PostgreSQL, Oracle, or Microsoft SQL Server, etc.,  you can use this tutorial to refresh your knowledge and understand how SQL dialect of MySQL is different from other systems. Section 1. Getting started with MySQL This section helps you get [...]

SAS 

SAS stands for Statistical Analysis Software. It was created in the year 1960 by the SAS Institute. From 1st January 1960, SAS was used for data management, business intelligence, Predictive Analysis, Descriptive and Prescriptive Analysis etc. Since then, many new statistical procedures and components were introduced in the software The core of the SAS System is base SAS software, which consists of SAS language a programming language that you use to manage your data.SAS procedures software tools for data analysis and reporting.macro facility a tool for extending and customizing SAS software programs and for reducing the text in your [...]

SQL

Overview SQL (Structured Query Language) is used to modify and access data or information from a storage area called database. This beginner online training SQL tutorial website teaches you the basics of SQL code and trains you how to write & program SQL queries. I will be sharing my database knowledge on SQL and help you learn programming SQL better. The concepts discussed in this SQL tutorial can be applied to most of the database systems. The SQL syntax used to explain the tutorial concepts is similar to the one used in Oracle database. What is SQL? What is SQL? SQL stands for “Structured Query [...]