Jessica Qiu

Subscribe to Jessica Qiu: eMailAlertsEmail Alerts
Get Jessica Qiu: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn

Top Stories by Jessica Qiu

Spreadsheet software is widely used by people in every industry with flexibility for data computing and analysis. But due to inherent drawbacks, common business spreadsheet software can't conduct relational query like SQL. The spreadsheet can implement the visualized calculation to some extent, and the nontechnical people can perform some rather complex calculations without having to learn the SQL. However, as the core of SQL, the relational query is unable to be implemented through common business spreadsheet software, which adds complexity to the apparently simple problems of multi-table join. For example, the Finance department needs to calculate the salary, and the relevant data is stored in ”standard sheet”, ” Absence sheet”, and ” performance sheet”, as shown in the below figure: If these three sheets can be joined, then you can compute it easily via the standard... (more)

Database to Implement Big Data Real-Time Application

The Big Data Real-time Application is a scenario to return the computation and analysis results in real time even if there are huge amounts of data. This is an emerging demand on database applications in recent years. In the past, because there wasn't a lot of data, the computation was simple, and few parallelisms, the pressure on the database wasn't great. A high-end or middle-range database server or cluster could allocate enough resources to meet the demand. Moreover, in order to rapidly and parallel access to the current business data and the historic data, users also tended t... (more)

Data Environments Support of esProc Makes Statistical Computing More Flexible

Enterprises always have various data sources, for instance, CRM system may use SQL Server, sales reports adopt Excel, ERP applies Oracle database. When it comes to actual business analysis, enterprises usually need to conduct interactive computation, including filter, group, etc among various data environments. But data Interaction between multiple data sources are not easy to realize with some traditional statistical computing tools. In order to solve such kind of problems, esProc which adapts to various data environments comes into being. Support of various data sources is an... (more)

An Example to Illustrate Hadoop Code Reuse

The MapReduce of Hadoop is a widely-used parallel computing framework. However, its code reuse mechanism is inconvenient, and it is quite cumbersome to pass parameters. Far different from our usual experience of calling the library function easily, I found both the coder and the caller must bear a sizable amount of precautions in mind when writing even a short pieces of program for calling by others. However, we finally find that esProc could easily realize code reuse in hadoop. Still a simple and understandable example of grouping and summarizing, let's check out a solution with... (more)

Which Tool Will You Choose for Java: Hibernate, esProc, SQL, iBATIS or R?

The data computation layer in between the data persistent layer and the application layer is responsible for computing the data from data persistence layer, and returning the result to the application layer. The data computation layer of Java aims to reduce the coupling between these two layers and shift the computational workload from them. The typical computation layer is characterized with below features: Ability to compute on the data from arbitrary data persistence layers, not only databases, but also the non-database Excel, Txt, or XML files. Of all these computations, the... (more)