Developping for the translation industry RSS 2.0



 Wednesday, 19 December 2007

Jeff Atwood recently posted on the topic of sorting strings in a more natural order that the default string sorting gives us.  His main point is, when we’re dealing with numbers, a simple alphabetic sorting doesn’t cut it.

So. just for fun, here’s my little C# 2.0 Implementation of a more human-friendly sorter.

 

public class FriendlySorter<T> : IComparer<T>

{

    public int Compare(T x, T y)

    {

        if (x == null || y == null) return 0;

        if (x.ToString() == y.ToString()) return 0;

 

        string[] left = Regex.Split(x.ToString(), "([0-9]+)");

        string[] right = Regex.Split(y.ToString(), "([0-9]+)");

 

        int index = 0;

 

        while (true)

        {

            while (left[index] == right[index]) index++;

 

            if (left.Length == index && right.Length == index) return 0;

            if (left.Length <= index) return -1;

            if (right.Length <= index) return 1;

 

            try

            {

                return (int.Parse(left[index]) - int.Parse(right[index]));

            }

            catch

            {

                int CompareResult = String.Compare(left[index], right[index]);

                if (CompareResult != 0) return CompareResult;

            }

 

            index++;

        }

    }

}

 

The following code :

List<string> list = new List<string>();

 

list.Add("Track 1 : abc");

list.Add("Track 2 : abc");

list.Add("Track 10 : abc");

list.Add("Track 5 : abc");

list.Add("Track 11 : abc");

list.Add("Track 22 : abc");

list.Add("Track 9 : abc");

 

list.Sort(new FriendlySorter<string>());

 

Will produce the results :

Sshot-4

For small sets the performance is Ok.  But when dealing with a larger amount of data (1000+ items), the sorting performance drops a lot (I currently have 5 seconds to sort 1000 random elements).  I will probably try to enhance it in the future and update this post.

Happy holidays !

Tags:
Wednesday, 19 December 2007 18:42:12 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
.NET | C# | Code Snippet
 Wednesday, 28 November 2007

Visual Studio 2008 is a great tool.  But there is one very annoying point. The bad news of the day is that, if you tried the pre-release version of Visual Studio 2008, there are a whooping 22 steps needed to un-install it.

Needless to say that it made me really cranky when I found about this.  I guess it's the price to pay when you are an early adopter..

Wednesday, 28 November 2007 11:20:34 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
General
 Thursday, 22 November 2007

Some complex queries involving more than one layer of calculations cannot be easily done in SQL.  Let’s say you need to compute the average number of sales per customer.  Your first thoughts may be something in the lines of :


SELECT AVG(COUNT(SALE_ID)) FROM SALES GROUP BY CUSTOMER_ID

But when you try to execute this kind of query, you will get an error message looking like this :

Cannot perform an aggregate function on an expression containing an aggregate or a subquery.

At this point, your instinct should tell you that the average of sales per customer should be something relatively easy to compute using SQL.  Some folks use temporary tables as a workaround to this limitation.  This solution is very bad for your query performance, especially when you have to deal with large databases.  The flow for such a task is :

1– Create a new temporary table to hold the computed values.

2– Insert the values in the new table.

3– Select the value you need from the temporary table.

4– Delete the temporary table.

The easier way that’s a lot faster both to write and to execute is to use derived tables.


SELECT AVG(number_of_sales) FROM (SELECT COUNT(SALE_ID) AS number_of_sales FROM SALES GROUP BY CUSTOMER_ID) dt_sales


The magic here lies in the FROM part.  The FROM keyword can use tables OR derived tables.  A derived table is simply created by writing the query and giving it an alias (in my examplen it is dt_sales).  Derived tables are created only in memory and thus are a lot faster than on-disk temporary tables.

Derived tables can be used exactly as regular database tables.  For example, you can make joins between them or between regular tables and derived tables.

Tags:
Thursday, 22 November 2007 13:13:09 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
SQL
 Tuesday, 23 October 2007

There are so much “productivity hacks” out there.  The blog lifehacker is full of little productivity hacks to boost your effectiveness.  For developers we have more stuff to look at that every one else combined.  This is not an article about some yet-another-incredible-new-tool, because there will be a better one next week.  I’ll have plenty of time later to talk about my favorite tools that are great Time Savers. But IMHO, there is no tool yet that is as high a productivity booster as owning whole features in an application.

The current problem with any tool is that you can have a productivity increase of 1.5x – 2x.  Maybe you can make that number 3x - 4x if the tasks you have to accomplish are repetitive, you are a master of your particular tool and you take the time to enhance it, creating your own snippet or macros.  The thing about it is that, with most these kind of tools, you just become a faster typist.  Since the number of lines of code is a terrible productivity metric, we can assess that being a faster typist doesn’t necessary make you a more productive developer.

The real productivity gains lies elsewhere.  It has been given many names : Being “in the zone”, “in the moment”, “in trance”.  Everybody has experienced this state of mind where time seems to stop, you lose awareness of everything around you and you wake up, several hours later, having done a week’s worth of work.  This state of mind is one of the main points of the classic software project management book Peopleware.  Getting into that state of mind can take from 15 minutes up to a full hour or even more depending on the individuals and many others parameters (ambient noise, stress level).  And getting out of that state of mind is terribly easy. 

But what’s all this have to do with developers owning some features?  Simply that owning some code will make the “in-the-zone-boot-up-time” a lot shorter when working on that particular piece of software.  Basically, what’s happening when you are in the process of getting in the zone?  Your subconscious mind is loading (or, at least, tries to load) the entire problem space that you’re facing.  When there are unknown elements, as there are always in shared code situation, the problem space is filled with variables elements, making the process really harder than if you know everything about the problem.

When you own features, you have seen (and probably written) every single line of code and you know every bit of functionality, every hack that shouldn’t be there and every little design flaw.  So when you work on that feature, the feature problem space contain only known elements and is a lot easier to load in your subconscious mind.  Moreover, it helps a lot for debugging and refactoring situations to know the entire feature characteristics.  The developers that own their features will avoid a lot more pitfalls than those who share functionnalities, simply because they know the implications everywhere of the code they’re writing.

Owning features is not only beneficial on the developer’s performance side.  Developers in general are elitist people and they don’t like others developers messing with their code.  The fact that it is clearly stated that features X, Y and Z are properties of Joe Developer will surely make him not only happier on the job but he will be a lot more committed to those features than he could ever have been otherwise.

But that’s just me, what’s your opinion on this subject?

Tuesday, 23 October 2007 08:28:49 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
General
 Monday, 22 October 2007

At the office, we use an Outlook addin (that I originally developed in Visual Studio 2002 targeting the 1.0 .NET Framework).  Finding that technology to be a little obsolete (all of our others internal applications are on the 2.0 Framework), we’ve decided to migrate it to 2.0 and add some news features while we’re at it.

The migration and installation went fine at first on my Vista development PC but I couldn’t get the addin to load on the users computers (that are on WinXP).  Every thing I tried, didn’t work and the addin couldn’t load at all.  What I ended up doing is a complete rebuild of the application using VSTO SE (Visual Studio Tools for Office Second Edition).  Since everything works well now, I’ll share with you every resource that I came across about creating an addin with VSTO.

1. Get the prerequisites:

In order to deploy a VSTO addin, you need the following prerequisites installed on every user’s computer:

 

2. Code the addin:

Pretty straightforward step, depending on your company’s requirements.  I’ll write another article about that part soon with some code examples.

 

3. Prepare for deployment:

In order to prepare for the deployment of your solution, you need to set the code access security policy for your assemblies.  To do so, you need to do the following tasks:

  • Get the “SetSecurity” project here and add it to your project.
  • Sign your assembly (or assemblies) in the Signing section of the project’s properties.  All you need to do is to check the Sign the assembly check box and create a new strong name file.  You need to make sure that every assembly that you deploy is signed (including the SetSecurity.dll assembly).

 Sshot-1

Sshot-02

 

  • Add the SetSecurity project output to your deployment project and add a custom action for Installation, Rollback and Uninstall.

Sshot-3

In the custom action data for the Install custom action, you need to enter the following data, replacing “OutlookAddin” with the name of your addin:

/assemblyName="OutlookAddin.dll" /targetDir="[TARGETDIR]\" /solutionCodeGroupName="MyCompany.OutlookAddin" /solutionCodeGroupDescription="Code group for OutlookAddin" /assemblyCodeGroupName="OutlookAddin" /assemblyCodeGroupDescription="Code group for OutlookAddin" /allUsers=[ALLUSERS]

For the Rollback and Uninstall, all you need to provide in the custom action data is:

/solutionCodeGroupName="MyCompany.OutlookAddin"

 

That’s a wrapper of about everything useful I found in my research.  I hope this helps you!  If there is something you think would be a good addition to this article, feel free to email me at: stan AT stanbiron DOT com.

Some more useful resources:

  • If you don’t have VSTO as part of your Visual Studio Installation, here is the link to download it separately.
  • If you code on a Vista PC and you use Visual Studio 2005 (with the Vista service pack), you won’t be able to debug into a VSTO addin for Office 2003.  Microsoft has issued an hotfix to resolve this problem.
  • If you can’t get the addin to work on the target computer, you can use this really great Outlook Addin Checker that looks for missing registry keys, prerequisites or Code Access Security policies.
  • There is a two-part article on MSDN for every step needed to deploy an VSTO addin or document customization: part 1 and part 2.  These article are somewhat long but every bit of information for the deployment part is there.  They also explain how to set custom deployment conditions to ensure that the prerequisites are installed.

 

Monday, 22 October 2007 16:14:29 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
General | Office Interop | Tools
 Wednesday, 10 October 2007

It’s a shame how many applications, even popular ones, have huge security gaps regarding passwords.  The most recent case to hit the news was Reddit but this is only the tip of the iceberg.  How many internals applications out there uses plain text password.  A whole lot!  You probably did it at some point, and so do I.  But the object of this post is not to rant and whine about that situation.  I just want to show those who don’t know about it how easy it is to integrate Windows Active Directory authentication to your application.  So now, if you read this my blog, no more excuses to have that “password” field in your database.

First you need to add a reference in your project to System.DirectoryServices

Here is the code, I will catch you up on the other side.

try

String Username = "username";

String domainAndUsername = "domain\\username";

String Password = "password";

 

DirectoryEntry entry = new DirectoryEntry("", domainAndUsername, Password);

DirectorySearcher search = new DirectorySearcher(entry);

 

search.Filter = "(SAMAccountName=" + Username + ")";

 

// search.FindOne() will throw an exception if there is a bad username/password combination provided

SearchResult result = search.FindOne();

 

// User is authenticated

catch 

    throw new System.Security.SecurityException("Access denied.");

End Try

Pretty straightforward isn’t it?  Obviously, you need to change the first lines with the user’s input but other than that, it’s all that’s needed for a basic username/password authentication using Active Directory.  You can copy this code and use it in your application and see for yourself.  If you want more information on what you can get from Active Directory, there is a good article on the different name attributes here.  You can also go to the homepage of System.DirectoryService on MSDN here.

I want to add that I’m not an Active Directory expert.  If a reader see something wrong with this code, please let me know and I’ll update it right away!

Other Posts:

How to create user accounts in active directory using C#

Wednesday, 10 October 2007 10:59:42 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
C# | Code Snippet
 Wednesday, 03 October 2007

It’s simply amazing! Scott Guthrie announced today that the .NET framework source code is going to be released later this year.  He says it will be released at the same time that Visual Studio 2008 and the 3.5 Framework are going to be released.  This means that you will be able to drill down inside the framework when debugging your applications in Visual Studio 2008.

This is a bold move from Microsoft’s part by any standards.  I think the motivation behind this release is two-fold:

1. This seems to be part of a larger company-wide politic to show to developers that Microsoft genuinely care about them.

2. They also need to work with the flow.  More and more, tools like Lutz Roeder’s .NET Reflector (a utility and Visual Studio plugin to decompile and visualize .NET assemblies) are becoming popular and the Framework is, for all intent and purposes, already opened and ripped apart.  I look at classes of the framework myself using this tool from time to time.

All that being said, this is really great news and I can’t wait to play with this and see the internals of the framework “in action”.

Click here to view the full article from Scott.

If you don’t know who Scott Guthrie is..  Well he’s the team leader for :

  • IIS
  • ASP.NET
  • The AJAX Toolkit
  • The CLR
  • The Compact Framework
  • Windows Forms
  • Commerce Server
  • Visual Web Developer 2005
  • Visual Studio Tools for WPF

In others words, if you love your job as a programmer using Microsoft’s technologies, it’s because of this guy.

Tags: ,
Wednesday, 03 October 2007 14:37:44 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
General | .NET | News
 Tuesday, 02 October 2007

If you haven’t already heard about it, (very unlikely since it has been all over the news) there is a bug in Microsoft’s Excel 2007 that display the wrong result when doing a multiplication.  The bug first shown up with the multiplication of 77.1 * 850 but the Excel team blog tells us that there are 12 specific calculations that are subject to this flaw.

Basically, this is a problem with the internal representation of numbers in Excel that gets translated for displaying.  It’s a conversion bug in the UI part so it won’t affect any calculation, charting or other number analysis on those numbers.  VBA script and COM Interop are also immune to this bug because they use the internal representation and not the number actually rendered on the Excel interface.  Joel of joelonsoftware.com has a really good article that explains the specifics of this bug. 

Every forum and blog posts that I've seen out there, with the exception of Joel’s one, shouts that this is a major/critical bug and start bashing on Microsoft for their “gross incompetence”.  My humble opinion on this is that those guys are going totally overboard and see critical problems when there are, in fact, none.  Since the bug we’re talking about affect only the number displaying but not the internal representation it will affect people’s life only if they do those exact calculations and don’t use the calculation result to do anything else.  The chances of that happening out there exists, I agree.  However, they are so slim that this bug should not be considered critical at all.  Microsoft will probably release a hotfix in the next days/weeks and it will all be history soon.

UPDATE: The Excel team has released a hotfix to correct this issue.  The fix can be found here.

Tuesday, 02 October 2007 14:45:18 (Eastern Standard Time, UTC-05:00)  #    Comments [0] -
General

Navigation
Advertisement
About the author/Disclaimer

Disclaimer
The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way.

© Copyright 2018
Stanislas Biron
Sign In
Statistics
Total Posts: 135
This Year: 0
This Month: 0
This Week: 0
Comments: 1
All Content © 2018, Stanislas Biron