Re: primary key - Database Discussion Boards

CitrusTech23-Jul-10 21:11

23-Jul-10 21:11

Take a look at http://dev.mysql.com/tech-resources/articles/hierarchical-data.html, about half-way down the page looks similar to what you're asking.

Data Quality and Data Profiling software

Table Optimization Problem - SQL Server

Jacobus0123-Jul-10 2:36

Jacobus01

23-Jul-10 2:36

Hi All,

I currently have a problem with two tables in SQL Server 2005. The tables have grown very large and sometimes lock up when small amounts of data are requested from them. The current record count is 9731495(PrintMedia) on the one table and 4126084(PrintArticles) on the other one. Table names: PrintArticles and PrintMedia. Whats worse is that these two tables are joined on each other in sql queries. This is because the database is non-relational. So joins are used to substitute for foreign keys. The performance needs to be improved. The SQL server version is 2005 standard edition - So partitioning is not possible Frown | :(

Here is a breakdown of the Tables with their fields, types and Max sizes:
PrintArticles:

Analysis int NULL
width float NULL
height float NULL
ArtSize float NULL
ID (PK) int NULL
Cutting text 2147483647 -This field contains on average 3000 characters
CutID int NULL
NewbaseNo varchar 20
AnalysisApplic int NULL
AnalysisUser nvarchar 50
Translations ntext 1073741823
AnalysisApplicDate datetime NULL
AnalysisApplicTime nvarchar 5
PerTranslation text 2147483647
PerTranslated int NULL

This table also has 3 indexes: 1 clustered on the PK and 2 non clustered

PrintMedia

PrintID (PK) int NULL
CuttID int NULL
Graphic int NULL
FpageSection int NULL
Caption varchar 1000
PDFPath varchar 300
Branch varchar 30
UploadTime varchar 10
AnalysisTag varchar 500
UserID varchar 30
Modifier varchar 30
DateModified varchar 10
PubID int NULL
NewBaseArticleNo varchar 20
Edition varchar 800
IS_HardCopy int NULL
Seen_by varchar 300
DTPDate varchar 10
ColourPDF int NULL
SpokesPerson varchar 200
Mention varchar 200
NLPU int NULL
Server varchar 10
Repl text 2147483647
GroupID int NULL
ArticleID int NULL
PubDate 5 varchar 10
CreatedDate varchar 10
Publication varchar 200
SubPublication varchar 200
Headline varchar 800
SubHeadline varchar 800
Journalist varchar 500
SubJournalist varchar 500
Page varchar 20
Client varchar 50
Category varchar 200
CategoryValue varchar 200
CategoryDisplayName varchar 200
OrderID varchar 5
TagID varchar 5
Language varchar 20
Section varchar 200
CCM real NULL
SizeX real NULL
SizeY real NULL
RandValue real NULL
FrontPageCover int NULL

This table has 12 non clustered indexes

I have to restart the sql server service at least once a day to remove locks. If that fails I have to reorganize the indexes or rebuild them to get the tables working again.

The previous developers wrote applications that use adhoc queries. The normal CRUD and the tables worked fine. But scalability was not considered it seems. Will it help if I modify all the select statements in these apps to not have locks eg "select [columns] from [tablename] with (nolock) where..."?. I just need these tables to perform faster and not lock up.

Plz advise.

Re: Table Optimization Problem - SQL Server

Eddy Vluggen23-Jul-10 5:06

Eddy Vluggen

23-Jul-10 5:06

Jacobus01 wrote:
Will it help if I modify all the select statements in these apps to not have locks eg "select [columns] from [tablename] with (nolock) where..."?.

It feels faster, doesn't it? The documentation on MSDN reveals why it's faster;

MSDN[^]:
Do not issue shared locks and do not honor exclusive locks. When this option is in effect, it is possible to read an uncommitted transaction or a set of pages that are rolled back in the middle of a read. Dirty reads are possible. Only applies to the SELECT statement.

That means that you could be reading data that's not committed. I'd be very carefull if the database contains a lot of stored procedures and utilizes transactions a lot.

How are the indexes, and what does the profiler say?

I are Troll Suspicious | :suss:

Re: Table Optimization Problem - SQL Server

Mycroft Holmes23-Jul-10 14:00

Mycroft Holmes

23-Jul-10 14:00

I assume that CutID is the FK to the tables, just wondering why it is not the primary on one of the table? Totally irrelevant to your problem.

I would first put in place a nightly maintenance job that cleaned up the indexes.

As to Eddys issue with uncommitted data, I would assume that queries are filtered and therefore this dodgy data should not be an issue except where you are specifically getting the latest information. There tends to be a small window of data that is subject to change, the last few records in the table. If you give your users a caveat that it may be dodgy I see no reason (nolock) should not be used.

CAVEAT - note I have never used (nolock) so take this cautiously.

Never underestimate the power of human stupidity
RAH

primary key

Neha_Gupta23-Jul-10 2:28

Neha_Gupta

23-Jul-10 2:28

If a primary key is dropped from a table and some records are duplicated after dropping primary key, then is it possible to apply the primary key on the same column. If yes, then how??

NEHA GUPTA

Re: primary key

R. Giskard Reventlov23-Jul-10 2:34

R. Giskard Reventlov

23-Jul-10 2:34

No, not really. The whole point of a primary key is that it is unique for each row (be it a surrogate key or a key made up from several columns). If you have duplicate rows then, by definition, you will be breaking the rule. Remove the duplicate rows and then apply a primary key.

Why would you deliberately remove the primary key and add duplicate rows?

"If you think it's expensive to hire a professional to do the job, wait until you hire an amateur." Red Adair.
nils illegitimus carborundum

me, me, me

Re: primary key

Dhyanga23-Jul-10 3:01

Dhyanga

23-Jul-10 3:01

ya how can again you set the primary key which already have two rows of same key ??? it is not possible...

suchita

Re: primary key

Neha_Gupta23-Jul-10 3:37

Neha_Gupta

23-Jul-10 3:37

but there is some option when we alter the table, (when we add constraint) with an option "with check" and "with nocheck". What is that option? Please clarify..

NEHA GUPTA

Re: primary key

Eddy Vluggen23-Jul-10 4:55

Eddy Vluggen

23-Jul-10 4:55

If you add a new constraint, Sql Server will test whether or not all of the records that are currently in the table pass that constraint.

Adding the NOCHECK option will skip that test.

You'd have to remove any double entries to recreate your primary key.

I are Troll Suspicious | :suss:

Database Normalisation

Euhemerus22-Jul-10 9:00

Euhemerus

22-Jul-10 9:00

I'm currently in the process of creating a service records database for storing information on machine servicing.

In the un-normalised customer table I have three attributes for recording telephone contact details: LandlineNumber, MobileNumber and FaxNumber.

Ordinarily, these attributes should be decomposed through the normal forms; 1NF, 2NF and 3NF.

TELEPHONE_1NF(TelNumber, CustomerNumber, TelNumberType)

TELEPHONE_2NF(TelNumber, CustomerNumber)
TELEPHONE_TYPE_2NF(TelNumber, TelType)

TELEPHONE_3NF(TelNumber, CustomerNumber)
TELEPHONE_TYPE_3NF(TelNumber, TelType)

My question is, in the real world, do you database designers out there ACTUALLY do this, or would you leave the three different telephone attributes in the customer table and invite null entries? Or, do you use a different strategy completely?

There is only one satisfying way to boot a computer.