scalability

Performance impact of having a data access layer/service layer?

I need to design a system which has these basic components: A Webserver which will be getting ~100 requests/sec. The webserver only needs to dump data into raw data repository. Raw data repository which has a single table which gets 100 rows/s from the webserver. A raw data processing unit (Simple processing, not much. Removing invalid...

Scalability of a single server for running a Java Web application

I want to gain more insight regarding the scale of workload a single-server Java Web application deployed to a single Tomcat instance can handle. In particular, let's pretend that I am developing a Wiki application that has a similar usage pattern like Wikipedia. How many simultaneous requests can my server handle reliably before going o...

What is best practice technique for scalable, routine processing of a large user base?

Suppose you have a database of users, which is constantly moving, and getting big. You want to do something to each user, every 24 hours, without fail. It doesn't matter what that something is, maybe send them each an email, or maybe something more complex that takes an unpredictable length of time. For a small site, I can imagine a sim...

db.get() vs db.get_by_key_name() performance (GAE BIgtable/Datastore)

How does the performance of db.get() compare with that of db.get_by_key_name()? ...

Should we start with multiple small-grained databases for an app that may scale massively

We're developing a new eCommerce website and are using NHibernate for the first time. At present we are splitting our data into multiple SQL Server databases, divided per area of functionality. So we have one for UserInfo, one for Orders, one for ProductCatalogue and so on... Our justification for this decision is twofold really: th...

case studies or examples of high throughput services with highly dynamic data

hello everyone, I'm looking for some architecture ideas on a problem at work that I may have to solve. the problem. 1) our enterprise LDAP has become a "contact master" filled with years of stale data and unused and unmaintained attributes. 2) management has decided that LDAP will no longer serve as a company phone book. it is for...

How to achieve Scalable code

How can one achieve scalable code. Better to frame the question as "What do you mean by scalability of code" and how to determine the extent to which code is scalable. Waiting for positive replies. Thanks in advance ...

How to gear towards scalability for a start up e-commerce portal?

I want to scale an e-commerce portal based on LAMP. Recently we've seen huge traffic surge. What would be steps (please mention in order) in scaling it: Should I consider moving onto Amazon EC2 or similar? what could be potential problems in switching servers? Do we need to redesign database? I read, Facebook switched to Cassandra fro...

Why I have to redefine data stucture twice? should I?

Choosing carefuly datatypes, finally a complex Class is created, then using a HashMap, those first Class objects get mapped with a key. and now I can create thousands and access them, add, delete, etc.. Now if I want to store them in a DB.. So I start from scratch? have to create the table?, again take care about datatypes? to make fit...

Should someone with no PHP experience use a framework like CakePHP or Symfony?

I have a simple site to develop and would like to learn PHP as I go. I want the site to be secure, scalable, and easy to maintain. Should I learn a framework and PHP simultaneously? If I build off of a framework there will be lots of unfamiliar code in play. Would you say this increases security risks? ...

Database architecture for millions of new rows per day

I need to implement a custom-developed web analytics service for large number of websites. The key entities here are: Website Visitor Each unique visitor will have have a single row in the database with information like landing page, time of day, OS, Browser, referrer, IP, etc. I will need to do aggregated queries on this database s...

File system scalability options

Hi, We have a NTFS volume used for storing large number of files(currently at 500GB and growing). Currently the same is accessed via a file share by a single application. We are looking at options to scale out the file server for access by multiple applications. The other applications would only be reading files and not performing any...

Is it recommended to use ASP.NET user management system on high traffic website

I am reprogramming a high traffic website in asp.net mvc. The client gets about 5 million hits a day. Would like to know if asp.net user management system is suited for this. The site contains over 500K users. ...

Table too big in MySQL database

Hi. I am making an online game and one table in my database. The most important one maybe. It's starting to become very large. I am creating a function where you can go together with multiple users and for that I have a table that look like this so far: CREATE TABLE `oc` ( `id` int(11) NOT NULL auto_increment, `leader` varchar(40) N...

Scalability of J2EE web services, how to approach?

Hello, I have this web client (built entirely through a Javascript Ajax framework), that communicates with the middle tier via web service calls (3 web services are deployed on JBoss, 1 web service is deployed on Tomcat). All of the web services access the same Oracle database. The web services are built through Axis2 and Spring. All th...

Short unique values with Amazon Web Services

I'm working on a system conceptually similar to an URL shortener like bit.ly -- a client sends in a bit of data and the system returns a very short URL to represent it (a base URL + a token representing the data). Ideally, it would start with the shortest token possible (maybe 4 or 5 alphanumeric characters) and move on to longer tokens ...

Thoughts on developing a scalable push notification system from scratch

Hey All, I'm trying to make a push notification service, mainly as a learning experience (so please do not start suggesting open/closed source alternatives). Being a hardcore Win32 c++ developer, i don't have a lot of web dev. knowledge. What I'm trying todo: Expose an API (over http(s)) to send push notifications to a certain devic...

PHP: MYSQL-based session sharing on different machine

Hi all, I have an app sits on machine A (solaris.example.com) and same apps sits on machine B (rhodes.example.com), these two machines are connected to same MYSQL database which has session table on it. I've implemented my own session handler so it saves to the database instead of saving it to the files and this works fine. My questio...

Most efficient way of getting the next unused id

(related to http://stackoverflow.com/questions/3439571/finding-the-lowest-unused-unique-id-in-a-list and http://stackoverflow.com/questions/2116056/getting-unused-unique-values-on-a-sql-table) Suppose I have a table containing on id column and some others (they don't make any difference here): +-----+-----+ | id |other| +-----+-----+ ...

Are there any pitfalls with having multiple MySQL databases on 1 server node?

I want to prepare my database for sharding in the future. But right now I only need 1 node. What I'd like to do is to shard the database into virtual shards, say 12 shards, and put all of them onto the same server node for now. And when I need to scale out, I could just then get a 2nd server and move 6 of the 12 virtual shard onto the ...