Skip to main content

Posts

SQL Interview Question

  SQL Interview Question: You are given two unrelated tables: Product — containing product details ProductSubcategory — containing subcategory details There are no common columns , and no foreign key or primary key relationships between them. Task: Write a SQL query to perform a join between these two tables, despite having no direct relationship. Query And Related Table: The problem with the above script is when you have exact match between the "product description" column and "subname" column then this code will work, but let say you have difference like Gadget and Gadgets then in this case the query will fail.  Feel free to post you comments over this solution, my approach would be: 1) Do the match over the letter by letter then count the total letter matched and then total letters if the percentage for this is above 80% then this record must be in the join condition. Basically perform the lookup with the help of SQL code. 2) Make abridge...
Recent posts

HTTP request to get data import into Database

What is HTTP request? It's a logic that can be used in order to pull data using api's created by individuals both private and public. Some API's are build in house to use it for own use and some are public API's, there are series of methods that can used in order to fetch data using these public API's. Let's see some of them: 1) GET: it is used to request some data from the frontend, usually the website URL, access token is required and in the URL itself the path is specified that someone is looking for. Let's understand with an example:- For a college website xyzuniversity.com, if I need to call the record for a student name "Alex" I will be writing the URL like GET http://university.com/student-record/alex:? And in the response I will be getting the associated record in JSON format. Here we used the GET method to call this data, like this we got multiple methods like PUT, HEAD, DELETE, PATCH,OPTIONS etc that can be used based on the need. Let...

How to be a DATA Analyst

To become a data analyst one need to be good at maths, basically numbers and visuals are two things every data analyst must know about. In my experience I believe domain knowledge and understanding the business is one of the key factor one would be needing to sort out pattens or analysis from the business data. Until and unless I don't know what my details is telling to me how I can be sure what to analyse. After all these basically a series of tool is important so that the work of analyst becomes easy, let's say tools like Excel, SQL , visualisation (tableau, powerbi), cloud computing (azure,AWS), modules in python like matplotlib, scikit learn, seaborn, pandas are some of the basic necessities that need to be fullfill. Some guidelines if I have to say is: 1) always try with clean data, then move to dirty data(mostly wrong columns values, mismatch column values, redundant data) 2) making quick visuals are always a better approach to increase confidence and skill in the path 3)...

Database Design and development

Database Design For any business process where the data need to be stored for later analysis or just for maintaining inventory or record, one need to maintain the day to day transactional data management.  For small business any approach used will work, like maintaining sheets, or csv all will work, but for mid scale or large scale companies data management is important to be done in database, since data could be fetched from different sources sometime it could be any frontend website or it may be any mobile application or computer software etc. Management of these set of information in a structured or say tabular format is known as database management.  Basically it categories in two types relational database and nosql database. For relational database software used are like SQL server, oracledb, azureSQl, MySQL etc. For nosql database we have format files file json or XML etc. Generally relational database management system is something which is widely used because of its re...

Patching in windows

What is Patching or Patch Management? Let say multiple system are installed in a organisation and all are connected to same domain controller(windows server 2012,2016,2024) and let say client system have win8 or win10 or win11. Let say all client are connected to domain controller and are domain member. Time to time windows do some updates in the operating system.So these updates need to be installed on the client systems.Patching or patch Management is the management of these updates on the client system.  Any provider like Microsoft, apple or Linux when create any operating system it tooks a lot of time in behind so every months or two they can not design new operating system so basically they introduce new features in the old operating system only, rather then building new operating system with new features, so what these organization's do is create patches and these upgrades are automatically being loaded to the windows all around the world. Obviously there are background serve...

Cardinality

What is cardinality? To understand this concept first need to know about relationship in database entities. When two or more table share Primary key and foreign key relationship that is a common column between them and with refrential integrity rule which tells the tables share dataset which can drilled down in both to get more shared information, can be termed as relationship between two tables. From relationship one cannot know how many times the data can be fetched in the foreign key table, or how much data is available in the foreign key table with respect to the first table, all such information can be given or tells by cardinality between tables. Cardinality basically tells the measure of participation of one table to second table. For example let say there are two entities named employee and department, in such case if the business rule said the employee can take part into one department only so it states the cardinality from employee to department is (1,1). But one department c...

Microsoft Outage for hours

What is outage? When an application or server fails to act may be because of hardware or software bugs then the application connected to it shows an error basically termed as outage. A general example could be we lost connection to our TV's in case of heavy rainfall or some disaster, here the cause is the servers managing the tv channel didn't find proper connectivity and it goes to outage that is no connection established. Same happens with the azure cloud services yesterday which leads to outage of server for couple of hours. Who was responsible? Basically it's not Microsoft who is solely responsible, the company name Cloud strike is a leading cyber security company which mange and control security on azure cloud, this company install an update on there software without testing it thoroughly, this is what everyone saying. But the point is in technical aspect of any software design, without being tested the application cannot go live with newer version. So basically it...

Data modelling

What is data modelling! A key concepts while building a fresh new database, one need follow certain guidelines which allows the proper and efficient structure of the database. These guidelines include a series of steps divided in basically three steps: 1) conceptual step 2) logical step 3) physical step As the name suggests DDLC cycle which consist of requirement,design, development,testing, implementation is divided in these three steps. Benefits are like one would have the proper hierarchy in the project life cycle, blueprints will be created by default since documentation is done at each step.

SQL query Optimization

What comes in mind when one think about optimization of SQL queries! Basically it requires the skills and resources to write queries on big size database or data warehouses. Why even sometimes optimization is leading challenge??? Let's dive into the core of optimization and see what we got there! If I am writing a select query to the server where multiple tables are involved so basically queries depends on the IO cost as well as the transformation operators applied to the raw data or if aggregation is involved. Sometime simple select query like fetch top 100 record from the table, people trying to optimize it. But that's not the issue actually. Problem arises when multiple technologies are connected to database and same time everyone needs efficiency so what should the optimal use to pull data. Let's try to understand with an example: Let say I have an regular API post request to the database table for filling out my orders or let say any transactional data, in such case, i...

Azure Resource Manager

Azure resource manager, organise azure services for individual project or user based on the day to day requirement. With an example if try to understand how's it's significant. Let's see! User A wants to pull data from chats happened in the YouTube channel everyday for live sessions as well as for the offline videos. Here to pull this data User A have managed to Resource manager and configure the pipeline separately for both, so may be end destination for the comments could be same but still the processing of data is carried in two seperate pipelines managed under two resource groups. Advantage:  In these two scenarios resources can be allocated by discussion and requirements for separate process. So costing would be charged differently for both use case and that would be efficient that way. If one resource .anager is configured for both cases then obviously costing would be same whether those configurations are even required or not. Secondly maintenance and debugging would...

Cloud Computing

What is Cloud computing ! In today's era nearly every industry need analytics and reporting for there organisation in order to stand in the competition as well as to cross verify their decision or to take new decision in day to day business processes. For example a pharmaceutical company if going to develop a new medicine whose alternates are already available in the market so before going to production obviously the organization wants to test the market and analyse how much they can stand and capture the market with this new medicine. To do this obviously data analyst will find some pattern analysis, or market fit for the product, marketing strategy evaluation and prediction all that will be done. To do these analysis, we need to process huge dataset everyday and after processing sometime that data even is not required, building server's inhouse for this process would be highly expensive as well not feasible in long running.  Cloud computing here comes in kuch handy way, and w...